Speech recognition employing a permissive recognition criterion for a repeated phrase utterance
First Claim
1. A method of recognizing a spoken phrase, the phrase including one or more words, the method comprising the steps of:
- performing a first speech recognition process in an attempt to recognize a first utterance of the phrase, said first speech recognition process employing a first speech recognition criterion;
if said first speech recognition process does not result in recognition of said first utterance in accordance with said first recognition criterion, establishing a time interval in which to receive a second utterance of the phrase; and
if said first speech recognition process does not result in recognition of said first utterance in accordance with said first recognition criterion, and if said second utterance is received during said time interval, performing a second speech recognition process in an attempt to recognize said second utterance, said second speech recognition process employing a second speech recognition criterion, wherein said second speech recognition criterion is more likely to be satisfied than said first speech recognition criterion.
6 Assignments
0 Petitions
Accused Products
Abstract
The invention relates to a method and apparatus for speech recognition, the speech to be recognized including one or more words. Recognition is based on an analysis of a first and a second utterance. In accordance with the invention, the first utterance is compared to one or more models of speech to determine a similarity metric for each such comparison. The model of speech which most closely matches the first utterance is determined based on the one or more similarity metrics. The similarity metric corresponding to the most closely matching model of speech is analyzed to determine whether the similarity metric satisfies a first recognition criterion. The second utterance is compared to one or more models of speech associated with the most closely matching model (which may include the most closely matching model) to determine a second utterance similarity metric for each such comparison. The one or more second utterance similarity metrics are analyzed to determine whether the one or more metrics satisfies a second recognition criteria. The second utterance is recognized has the phrase corresponding to the most closely matching model of speech when the first and second recognition criteria are satisfied. The present invention has application to many problems in speech recognition including isolated word recognition and command spotting. An illustrative embodiment of the invention in the context of a cellular telephone is provided. Other embodiments are also discussed.
104 Citations
36 Claims
-
1. A method of recognizing a spoken phrase, the phrase including one or more words, the method comprising the steps of:
-
performing a first speech recognition process in an attempt to recognize a first utterance of the phrase, said first speech recognition process employing a first speech recognition criterion; if said first speech recognition process does not result in recognition of said first utterance in accordance with said first recognition criterion, establishing a time interval in which to receive a second utterance of the phrase; and if said first speech recognition process does not result in recognition of said first utterance in accordance with said first recognition criterion, and if said second utterance is received during said time interval, performing a second speech recognition process in an attempt to recognize said second utterance, said second speech recognition process employing a second speech recognition criterion, wherein said second speech recognition criterion is more likely to be satisfied than said first speech recognition criterion. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. An apparatus for recognizing a spoken phrase, the phrase including one or more words, the apparatus comprising:
-
means for performing a first speech recognition process in an attempt to recognize a first utterance of the phrase, said first speech recognition process employing a first speech recognition criterion; means for establishing a time interval in which to receive a second utterance of the phrase when said first speech recognition process does not result in recognition of said first utterance in accordance with said first recognition criterion; and means for performing a second speech recognition process in an attempt to recognize said second utterance if said first speech recognition process does not result in recognition of said first utterance in accordance with said first recognition criterion and if said second utterance is received during said time interval, said second speech recognition process employing a second speech recognition criterion, wherein said second speech recognition criterion is more likely to be satisfied than said first speech recognition criterion. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
Specification