Multiple pass speech recognition method and system
First Claim
1. A method of recognizing speech, the method comprising:
- receiving an input speech signal;
performing an initial recognition on the input speech signal to generate a first pass result;
generating a first grammar based upon the first pass result, the first grammar having a portion set to match a first part of the input speech signal; and
applying the first grammar to the input speech signal to generate a second pass result, wherein generating a first grammar comprises;
determining a context of the first pass result;
determining the portion of the first grammar to be set to match the first part of the input speech signal based upon the determined context of the first pass result; and
generating the first grammar with the portion set to match the first part of the input speech signal.
1 Assignment
0 Petitions
Accused Products
Abstract
A multiple pass speech recognition method includes a first pass and a second pass. The first pass recognizes an input speech signal to generate a first pass result. The second pass generates a first grammar having a portion set to match a first part of the input speech signal, based upon the context of the first pass result, and generate a second pass result. The method may further include a third pass grammar limiting the second part of the input speech signal to the second pass result. The third pass grammar includes a model corresponding to the first part of the input speech signal and varying within the second pass result. The third pass compares the first part of the input speech signal to the model while limiting the second part of the input speech signal to the second pass result.
139 Citations
35 Claims
-
1. A method of recognizing speech, the method comprising:
-
receiving an input speech signal; performing an initial recognition on the input speech signal to generate a first pass result; generating a first grammar based upon the first pass result, the first grammar having a portion set to match a first part of the input speech signal; and applying the first grammar to the input speech signal to generate a second pass result, wherein generating a first grammar comprises; determining a context of the first pass result; determining the portion of the first grammar to be set to match the first part of the input speech signal based upon the determined context of the first pass result; and generating the first grammar with the portion set to match the first part of the input speech signal. - View Dependent Claims (2, 3, 4)
-
-
5. A method of recognizing speech, the method comprising:
-
receiving an input speech signal; performing an initial recognition on the input speech signal to generate a first pass result; generating a first grammar based upon the first pass result, the first grammar having a portion set to match a first part of the input speech signal; applying the first grammar to the input speech signal to generate a second pass result; generating a second grammar based upon the second pass result, the second grammar limiting the second part of the input speech signal to the second pass result and configured to recognize the first part of the input speech signal within the second pass result; and applying the second grammar to the input speech signal to generate a third pass result. - View Dependent Claims (6, 7, 8)
-
-
9. A computer program product for recognizing speech, the computer program product stored on a computer readable medium and adapted to perform a method comprising:
-
receiving an input speech signal; performing an initial recognition on the input speech signal to generate a first pass result; generating a first grammar based upon the first pass result, the first grammar having a portion set to match a first part of the input speech signal; and applying the first grammar to the input speech signal to generate a second pass result, wherein generating a first grammar comprises; determining a context of the first pass result; determining the portion of the first grammar to be set to match the first part of the input speech signal based upon the determined context of the first pass result; and generating the first grammar with the portion set to match the first part of the input speech signal. - View Dependent Claims (10, 11, 12)
-
-
13. A computer program product for recognizing speech, the computer program product stored on a computer readable medium and adapted to perform a method comprising:
-
receiving an input speech signal; performing an initial recognition on the input speech signal to generate a first pass result; generating a first grammar based upon the first pass result, the first grammar having a portion set to match a first part of the input speech signal; and applying the first grammar to the input speech signal to generate a second pass result; generating a second grammar based upon the second pass result, the second grammar limiting the second part of the input speech signal to the second pass result and configured to recognize the first part of the input speech signal within the second pass result; and applying the second grammar to the input speech signal to generate a third pass result. - View Dependent Claims (14, 15, 16)
-
-
17. A speech recognition system using a multiple pass speech recognition method including at least a first pass and a second pass, the speech recognition system comprising
a speech recognition engine for performing an initial recognition on an input speech signal in the first pass to generate a first pass result and applying a first grammar to the input speech signal in the second pass to generate a second pass result; -
a grammar database for storing a plurality of grammar; and a dynamic grammar generator for generating the first grammar based upon the first pass result using the grammar stored in the grammar database, the first grammar having a portion set to match a first part of the input speech signal and configured to recognize a second part of the input speech signal, wherein the dynamic grammar generator determines a context of the first pass result and determines the portion of the first grammar to be set to match the first part of the input speech signal based upon the determined context of the first pass result. - View Dependent Claims (18, 19, 20, 21)
-
-
22. A speech recognition system using a multiple pass speech recognition method including at least a first pass and a second pass, the speech recognition system comprising:
-
a speech recognition engine for performing an initial recognition on an input speech signal in the first pass to generate a first pass result and applying a first grammar to the input speech signal in the second pass to generate a second pass result; a grammar database for storing a plurality of grammar; and a dynamic grammar generator for generating the first grammar based upon the first pass result using the grammar stored in the grammar database, the first grammar having a portion set to match a first part of the input speech signal and configured to recognize a second part of the input speech signal, wherein the multiple pass speech recognition method further comprises a third pass, the dynamic grammar generator generating a second grammar based upon the second pass result, the second grammar limiting the second part of the speech to the second pass result and configured to recognize the first part of the input speech signal within the second pass result; and the speech recognition engine applying the second grammar to the input speech signal to generate a third pass result. - View Dependent Claims (23, 24, 25)
-
-
26. A method of recognizing speech, the method comprising:
-
receiving an input speech signal; performing an initial recognition on the input speech signal to generate a first pass result; determining a level of the first pass result in a knowledge hierarchy; and generating a first grammar having a level higher in the knowledge hierarchy than the level of the first pass result, the second pass grammar having a portion set to match a first part of the input speech signal; and applying the first grammar to the input speech signal to generate a second pass result. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34)
-
-
35. A server for use in a networked speech recognition system using a multiple pass speech recognition method including at least a first pass and a second pass for recognition of an input speech signal, the server comprising:
-
a grammar database for storing a plurality of grammar; and a dynamic grammar generator for generating a first grammar based upon a result of the first pass using the grammar stored in the grammar database, the first grammar having a portion set to match a first part of the input speech signal and configured to recognize a second part of the input speech signal, wherein the dynamic grammar generator further generates a second grammar based upon a result of the second pass, the second grammar limiting the second part of the input speech signal to the result of the second pass and configured to recognize the first part of the input speech signal within the result of the second pass result.
-
Specification