Addressing Missing Features in Models
First Claim
1. A method performed by one or more computers, comprising:
- receiving data indicating a candidate transcription for an utterance and a context for the utterance;
accessing a language model that includes a respective score for each of a plurality of features, each feature corresponding to a word or phrase occurring in an associated context that includes one or more preceding words;
determining that the language model does not include a score for a feature corresponding to the candidate transcription in the particular context;
determining a score corresponding to the candidate transcription in the particular context, wherein the score is determined based on one or more scores included in the language model for one or more of the plurality of features that are associated with the particular context;
determining, using the language model and the determined score, a probability score indicating a likelihood of occurrence of the candidate transcription in the particular context;
selecting, based on the probability score, a transcription for the utterance from among a plurality of candidate transcriptions; and
providing the selected transcription to a client device.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for addressing missing features in models are provided. In some implementations, a model configured to indicate likelihoods of different outcomes is accessed. The model includes a respective score for each of a plurality of features, and each feature corresponds to an outcome in an associated context. It is determined that the model does not include a score for a feature corresponding to a potential outcome in a particular context. A score is determined for the potential outcome in the particular context based on the scores for one or more features in the model that correspond to different outcomes in the particular context. The model and the score are used to determine a likelihood of occurrence of the potential outcome.
-
Citations
20 Claims
-
1. A method performed by one or more computers, comprising:
-
receiving data indicating a candidate transcription for an utterance and a context for the utterance; accessing a language model that includes a respective score for each of a plurality of features, each feature corresponding to a word or phrase occurring in an associated context that includes one or more preceding words; determining that the language model does not include a score for a feature corresponding to the candidate transcription in the particular context; determining a score corresponding to the candidate transcription in the particular context, wherein the score is determined based on one or more scores included in the language model for one or more of the plurality of features that are associated with the particular context; determining, using the language model and the determined score, a probability score indicating a likelihood of occurrence of the candidate transcription in the particular context; selecting, based on the probability score, a transcription for the utterance from among a plurality of candidate transcriptions; and providing the selected transcription to a client device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving data indicating a candidate transcription for an utterance and a context for the utterance; accessing a language model that includes a respective score for each of a plurality of features, each feature corresponding to a word or phrase occurring in an associated context that includes one or more preceding words; determining that the language model does not include a score for a feature corresponding to the candidate transcription in the particular context; determining a score corresponding to the candidate transcription in the particular context, wherein the score is determined based on one or more scores included in the language model for one or more of the plurality of features that are associated with the particular context; determining, using the language model and the determined score, a probability score indicating a likelihood of occurrence of the candidate transcription in the particular context; selecting, based on the probability score, a transcription for the utterance from among a plurality of candidate transcriptions; and providing the selected transcription to a client device. - View Dependent Claims (16, 17)
-
-
18. A non-transitory computer readable storage medium storing instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
-
receiving data indicating a candidate transcription for an utterance and a context for the utterance; accessing a language model that includes a respective score for each of a plurality of features, each feature corresponding to a word or phrase occurring in an associated context that includes one or more preceding words; determining that the language model does not include a score for a feature corresponding to the candidate transcription in the particular context; determining a score corresponding to the candidate transcription in the particular context, wherein the score is determined based on one or more scores included in the language model for one or more of the plurality of features that are associated with the particular context; determining, using the language model and the determined score, a probability score indicating a likelihood of occurrence of the candidate transcription in the particular context; selecting, based on the probability score, a transcription for the utterance from among a plurality of candidate transcriptions; and providing the selected transcription to a client device. - View Dependent Claims (19, 20)
-
Specification