Generation of predictive natural language processing models
First Claim
Patent Images
1. A system comprising:
- a computer-readable memory storing executable instructions; and
one or more processors in communication with the computer-readable memory, wherein the one or more processors are programmed by the executable instructions to at least;
obtain natural language processing personalization data associated with a user, the natural language processing personalization data comprising data regarding items in a user-specific content catalog associated with the user;
generate a personal language model using at least the data regarding items in the user-specific content catalog, wherein the personal language model is specific to the user, wherein the personal language model includes a first subset of items in a general language model, and wherein the general language model is not associated with any specific user;
determine, using at least the data regarding items in the user-specific content catalog, a plurality of user-specific predicted items about which the user is predicted to make a future utterance, wherein the plurality of user-specific predicted items are not in the user-specific content catalog;
generate a predictive language model based at least on the plurality of user-specific predicted items, wherein the predictive language model is associated with the user, and wherein the predictive language model includes a second subset of items in the general language model;
generate a weighting factor for the general language model, wherein the weighting factor, when applied to the general language model, reduces probabilities associated with individual items in the general language model that are determined to be acoustically confusable with at least portion of the user-specific predicted items; and
subsequently;
process an utterance using the personal language model, the predictive language model, the general language model, and the weighting factor, wherein the utterance includes a first item of the plurality of user-specific predicted items;
recognize the first item based at least on the personal language model, the predictive language model, and the general language model, wherein the first item is recognized based at least partly on a first probability for the first item being higher than a second probability for a second item and a third probability for a third item, wherein a value of the first probability comprises a probability value from the personal language model, wherein a value of the second probability comprises a probability value from the predictive language model, and wherein a value of the third probability comprises a product of the weighting factor and a probability value from the general language model; and
play, on a user computing device, audio content associated with the first item.
1 Assignment
0 Petitions
Accused Products
Abstract
Features are disclosed for generating predictive personal natural language processing models based on user-specific profile information. The predictive personal models can provide broader coverage of the various terms, named entities, and/or intents of an utterance by the user than a personal model, while providing better accuracy than a general model. Profile information may be obtained from various data sources. Predictions regarding the content or subject of future user utterances may be made from the profile information. Predictive personal models may be generated based on the predictions. Future user utterances may be processed using the predictive personal models.
40 Citations
22 Claims
-
1. A system comprising:
-
a computer-readable memory storing executable instructions; and one or more processors in communication with the computer-readable memory, wherein the one or more processors are programmed by the executable instructions to at least; obtain natural language processing personalization data associated with a user, the natural language processing personalization data comprising data regarding items in a user-specific content catalog associated with the user; generate a personal language model using at least the data regarding items in the user-specific content catalog, wherein the personal language model is specific to the user, wherein the personal language model includes a first subset of items in a general language model, and wherein the general language model is not associated with any specific user; determine, using at least the data regarding items in the user-specific content catalog, a plurality of user-specific predicted items about which the user is predicted to make a future utterance, wherein the plurality of user-specific predicted items are not in the user-specific content catalog; generate a predictive language model based at least on the plurality of user-specific predicted items, wherein the predictive language model is associated with the user, and wherein the predictive language model includes a second subset of items in the general language model; generate a weighting factor for the general language model, wherein the weighting factor, when applied to the general language model, reduces probabilities associated with individual items in the general language model that are determined to be acoustically confusable with at least portion of the user-specific predicted items; and subsequently; process an utterance using the personal language model, the predictive language model, the general language model, and the weighting factor, wherein the utterance includes a first item of the plurality of user-specific predicted items; recognize the first item based at least on the personal language model, the predictive language model, and the general language model, wherein the first item is recognized based at least partly on a first probability for the first item being higher than a second probability for a second item and a third probability for a third item, wherein a value of the first probability comprises a probability value from the personal language model, wherein a value of the second probability comprises a probability value from the predictive language model, and wherein a value of the third probability comprises a product of the weighting factor and a probability value from the general language model; and play, on a user computing device, audio content associated with the first item. - View Dependent Claims (2, 3, 17, 20)
-
-
4. A computer-implemented method comprising:
-
under control of one or more computing devices configured to execute specific instructions, obtaining content catalog information specific to a user profile; generating a personal language model based at least on the content catalog information, wherein the personal language model is specific to the user profile, wherein the personal language model includes a first subset of items in a general language model, and wherein the general language model is not associated with any specific user profile; determining, based at least on the content catalog information, predicted utterance content, wherein the content catalog information does not include the predicted utterance content; generating a predictive language model based at least on the predicted utterance content, wherein the predictive language model is specific to the user profile, and wherein the predictive language model includes a second subset of items in the general language model; generating a weighting factor that, when applied to the general language model, reduces probabilities associated with at least a portion of items in the general language model that are determined to be acoustically confusable for at least;
a portion of items of the predicted utterance content, or a portion of items of the content catalog information;recognizing a first item in a first utterance using the general language model in combination with the personal language model and the predictive language model, wherein the first item is recognized based at least partly on a first probability for the first item being higher than a second probability for a second item and a third probability for a third item, wherein a value of the first probability comprises a probability value from of the personal language model, wherein a value of the second probability comprises a probability value from the predictive language model, and wherein a value of the third probability comprises a product of the weighting factor and a probability value from the general language model; and causing audio content associated with the first item to be played by an output device. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 15, 16, 18, 21, 22)
-
-
12. One or more non-transitory computer readable media comprising executable code that, when executed, cause one or more computing devices to perform a process comprising:
-
obtaining catalog information associated with a user profile; generating a personal language model based at least on the catalog information, wherein the personal language model is specific to the user profile, wherein the personal language model includes a first subset of items in a general language model, and wherein the general language model is not associated with any specific user profile; determining, based at least on the catalog information, predicted natural language processing input content, wherein the catalog information does not include the predicted natural language processing input content; generating a predictive language model based at least on the predicted natural language processing input content, wherein the predictive language model is associated with the user profile, and wherein the predictive language model includes a subset of items in the general language model; generating a weighting factor that, when applied to the predictive language model, modifies probabilities associated with at least a portion of items of the predicted natural language processing input content that are determined to be acoustically confusable for at least a portion of items in the general language model; recognizing a first item in a first utterance using the general language model in combination with the personal language model and the predictive language model, wherein the first item is recognized based at least partly on a first probability for the first item being higher than a second probability for a second item and a third probability for a third item, wherein a value of the first probability comprises a product of the weighting factor and a probability value from the personal language model, wherein a value of the second probability comprises a probability value from the predictive language model, and wherein a value of the third probability comprises a probability value from the general language model; and causing audio content associated with the first item to be played by an output device. - View Dependent Claims (13, 14, 19)
-
Specification