Voice recognition rejection scheme
First Claim
1. A method of capturing an utterance in a voice recognition system, comprising the steps of:
- comparing the utterance to a first stored word to generate a first score;
comparing the utterance to a second stored word to generate a second score;
determining a difference between the first score end the second score;
determining a ratio of the first score to the difference; and
processing the utterance based on the ratio.
1 Assignment
0 Petitions
Accused Products
Abstract
A voice recognition rejection scheme for capturing an utterance includes the steps accepting the utterance, applying an N-best algorithm to the utterance, or rejecting the utterance. The utterance is accepted if a first predefined relationship exists between one or more closest comparison results for the utterance with respect to a stored word and one or more differences between the one or more closest comparison results and one or more other comparison results between the utterance and one or more other stored words. An N-best algorithm is applied to the utterance if a second predefined relationship exists between the one or more closest comparison results and the one or more differences between the one or more closest comparison results and the one or more other comparison results. The utterance is rejected if a third predefined relationship exists between the one or more closest comparison results and the one or more differences between the one or more closest comparison results and the one or more other comparison results. One of the one or more other comparison results may advantageously be a next-closest comparison result for the utterance and another store word. The first, second, and third predefined relationships may advantageously be linear relationships.
-
Citations
40 Claims
-
1. A method of capturing an utterance in a voice recognition system, comprising the steps of:
-
comparing the utterance to a first stored word to generate a first score;
comparing the utterance to a second stored word to generate a second score;
determining a difference between the first score end the second score;
determining a ratio of the first score to the difference; and
processing the utterance based on the ratio. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
if the ratio of the first score to the difference is within a first range of values, accepting the utterance;
if the ratio of the first score to the difference is within a second range of values, applying an N-best algorithm to verify the utterance; and
if the ratio of the first score to the difference is within a third range of values, rejecting the utterance.
-
-
3. The method of claim 1, wherein the difference corresponds to a change of score between the first score and the second score.
-
4. The method of claim 1, wherein the first stored word comprises a best candidate in a vocabulary of a voice recognition system, and the second stored word comprises a next-best candidate in a vocabulary of a voice recognition system.
-
5. The method of claim 1, wherein the first score comprises a closest comparison result, and the second score comprises a next-closest comparison result.
-
6. The method of claim 1, wherein the first score and the second score comprise linear predictive coding coefficients.
-
7. The method of claim 1, wherein the first score and the second score comprise cepstral coefficients.
-
8. The method of claim 1, wherein the first score and the second score comprise bandpass filter outputs.
-
9. The method of claim 2, wherein the first, second, and third range of values define linear relationships between the first score and the difference.
-
10. The method of claim 1, wherein the difference comprises a difference between a closest comparison result and a next-closest comparison result.
-
11. A voice recognition system, comprising:
-
an acoustic processor configured to extract speech parameters from digitized speech samples of an utterance; and
a processor coupled to the acoustic processor and configured to compare the utterance to a first stored word to generate a first score;
compare the utterance to a second stored word to generate a second score;
determine a difference between the first score and the second score;
determine a ratio of the first score to the difference; and
process the utterance based on the relationship. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
accept the utterance if the ratio of the first score to the difference is within a first range of values;
apply an N-best algorithm to verify the utterance if the ratio of the first score to the difference is within a second range of values; and
reject the utterance if the ratio of the first score to the difference is within a third range of values, rejecting the utterance.
-
-
13. The voice recognition system of claim 11, wherein the difference corresponds to a change of score between the first score and the second score.
-
14. The voice recognition system of claim 11, wherein the first stored word comprises a best candidate in a vocabulary of the voice recognition system, and the second stored word comprises a next-best candidate in a vocabulary of the voice recognition system.
-
15. The voice recognition system of claim 11, wherein the first score comprises a closest comparison result, and the second score comprises a next-closest comparison result.
-
16. The voice recognition system of claim 11, wherein the first and the second score comprise linear predictive coding coefficients.
-
17. The voice recognition system of claim 11, wherein the first score and the second score comprise cepstral coefficients.
-
18. The voice recognition system of claim 11, wherein the first score and the second score comprise bandpass filter outputs.
-
19. The voice recognition system of claim 12, wherein the first, second, and third range of values define linear relationships between the first score and the difference.
-
20. The voice recognition system of claim 11, wherein the difference comprises a difference between a closest comparison result and a next-closest comparison result.
-
21. A voice recognition system, comprising:
-
means for comparing the utterance to a first stored word to generate a first score;
means for comparing the utterance to a second stored word to generate a second score;
means for determining a difference between the first score and the second score;
means for determining a ratio of the first score to the difference; and
means for processing the utterance based on the relationship. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
means for accepting the utterance if the ratio of the first score to the difference is within a first range of values;
means for applying an N-best algorithm to verify the utterance if the ratio of the first score to the difference is within a second range of values; and
means for rejecting the utterance if the ratio of the first score to the difference is within a third range of values, rejecting the utterance.
-
-
23. The voice recognition system of claim 22, wherein the first, second, and third range of values define linear relationships between the first score and the difference.
-
24. The voice recognition system of claim 21, wherein the difference corresponds to a change of score between the first score and the second score.
-
25. The voice recognition system of claim 21, wherein the first stored word comprises a best candidate in a vocabulary of the voice recognition system, and the second stored word comprises a next-best candidate in a vocabulary of the voice recognition system.
-
26. The voice recognition system of claim 21, wherein the first score comprises a closest comparison result, and the second comprises a next-closest comparison result.
-
27. The voice recognition system of claim 21, wherein the first score and the second comprise linear predictive coding coefficients.
-
28. The voice recognition system of claim 21, wherein the first score and the second score comprise cepstral coefficients.
-
29. The voice recognition system of claim 21, wherein the first score and the second score comprise bandpass filter outputs.
-
30. The voice recognition system of claim 21, wherein difference comprises a difference between a closest comparison result and a next-closest comparison result.
-
31. A voice recognition system, comprising:
-
means for extracting speech parameters from digitized speech samples of an utterance; and
means for;
comparing the utterance to a first stored word to generate a first score;
comparing the utterance to a second stored word to generate a second score;
determining a difference between the first score and the second score;
determining a ratio of the first score to the difference; and
processing the utterance based on the relationship. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38, 39, 40)
accepting the utterance if the ratio of the first score to the difference is within a first range of values;
applying an N-best algorithm to verify the utterance if the ratio of the first score to the difference is within a second range of values; and
rejecting the utterance if the ratio of the first score to the difference is within a third range of values, rejecting the utterance.
-
-
33. The voice recognition system of claim 31, wherein the difference comprises a difference between a closest comparison result and a next-closest comparison result.
-
34. The voice recognition system of claim 31, wherein the difference corresponds to a change of score between the first score and the second score.
-
35. The voice recognition system of claim 31, wherein the first stored word comprises a best candidate in a vocabulary of the voice recognition system, and the second stored word comprises a next-best candidate in a vocabulary of the voice recognition system.
-
36. The voice recognition system of claim 31, wherein the first score comprises a closest comparison result, and the second score comprises a next-closest comparison result.
-
37. The voice recognition system of claim 31, wherein the first score and the second score comprise linear predictive coding coefficients.
-
38. The voice recognition system of claim 31, wherein the first score and the second score comprise cepstral coefficients.
-
39. The voice recognition system of claim 31, wherein the first score and the second score comprise bandpass filter outputs.
-
40. The voice recognition system of claim 32, wherein the first, second, and third range of values define linear relationships between the first score and the difference.
Specification