Dynamic speech sharpening
First Claim
1. A method for providing out-of-vocabulary interpretation capabilities and for tolerating noise when interpreting natural language speech utterances, the method comprising:
- receiving an utterance from a user;
recognizing a stream of phonemes contained in the utterance on an electronic device;
mapping the recognized stream of phonemes to an acoustic grammar that phonemically represents one or more syllables, the recognized stream of phonemes mapped to a series of one or more of the phonemically represented syllables; and
generating at least one interpretation of the utterance, wherein the generated interpretation includes the series of syllables mapped to the recognized stream of phonemes.
5 Assignments
0 Petitions
Accused Products
Abstract
An enhanced system for speech interpretation is provided. The system may include receiving a user verbalization and generating one or more preliminary interpretations of the verbalization by identifying one or more phonemes in the verbalization. An acoustic grammar may be used to map the phonemes to syllables or words, and the acoustic grammar may include one or more linking elements to reduce a search space associated with the grammar. The preliminary interpretations may be subject to various post-processing techniques to sharpen accuracy of the preliminary interpretation. A heuristic model may assign weights to various parameters based on a context, a user profile, or other domain knowledge. A probable interpretation may be identified based on a confidence score for each of a set of candidate interpretations generated by the heuristic model. The model may be augmented or updated based on various information associated with the interpretation of the verbalization.
607 Citations
16 Claims
-
1. A method for providing out-of-vocabulary interpretation capabilities and for tolerating noise when interpreting natural language speech utterances, the method comprising:
-
receiving an utterance from a user; recognizing a stream of phonemes contained in the utterance on an electronic device; mapping the recognized stream of phonemes to an acoustic grammar that phonemically represents one or more syllables, the recognized stream of phonemes mapped to a series of one or more of the phonemically represented syllables; and generating at least one interpretation of the utterance, wherein the generated interpretation includes the series of syllables mapped to the recognized stream of phonemes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for providing out-of-vocabulary interpretation capabilities and for tolerating noise when interpreting natural language speech utterances, the system comprising:
-
at least one input device that receives an utterance from a user and generates an electronic signal corresponding to the utterance; and a speech interpretation engine that receives the electronic signal corresponding to the utterance, the speech interpretation engine operable to; recognize a stream of phonemes contained in the utterance; map the recognized stream of phonemes to an acoustic grammar that phonemically represents one or more syllables, the recognized stream of phonemes mapped to a series of one or more of the phonemically represented syllables; and generate at least one interpretation of the utterance, wherein the generated interpretation includes the series of syllables mapped to the recognized stream of phonemes. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
Specification