Speech recoginition methods and apparatus
First Claim
1. A speech recognition method, comprising the steps of:
- processing a signal including an utterance to recognize a word included in the utterance, the processing step including the steps of;
generating a set of signal characteristic information from the utterance and;
scoring the set of signal characteristic information against a plurality of different speech models at least two of which were generated using different speech model generation techniques, the scoring against different speech models which were generated using different model generation techniques being performed differently.
7 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus for transitioning from one speech recognition system to another and for reusing existing speech recognition data are described. In particular, various methods of converting speech recognition templates or models from a first format to a second format are described. Methods for improving the recognition rate achieved using converted templates or models are also described. These methods involve storing source and/or scoring information for templates or models so that converted models or templates can be scored differently than original models or templates to thereby reflect the effect the conversion process has on recognition scores. In order to enhance recognition results in one embodiment an available compressed voice recording is used in the conversion process. The methods and apparatus of the present invention can be applied to a wide variety of speech recognition template and model conversion applications. Methods and apparatus for generating garbage models are also described. In one embodiment a garbage model is generated dynamically at recognition time using a period of silence in the utterance upon which the recognition operation is to be performed as the source of the data required to generated the garbage model. In this manner a garbage model is generated to reflect the particular background noise conditions, associated with a particular utterance, upon which speech recognition is to be performed.
-
Citations
25 Claims
-
1. A speech recognition method, comprising the steps of:
processing a signal including an utterance to recognize a word included in the utterance, the processing step including the steps of; generating a set of signal characteristic information from the utterance and; scoring the set of signal characteristic information against a plurality of different speech models at least two of which were generated using different speech model generation techniques, the scoring against different speech models which were generated using different model generation techniques being performed differently. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
14. A method of performing speech recognition comprising the steps of:
-
storing speech recognition models from a first source; storing speech recognition models from a second source; receiving a signal including a segment of speech upon which a speech recognition task is to be performed; accessing said stored speech recognition models; scoring, the received segment of speech against the accessed speech recognition models, the scoring being performed in such a manner that the scoring applied to models from the first source is different than the scoring applied to models from the first source; and determining if the received segment of speech corresponds to one of the accessed speech recognition models as a function of a result of the scoring operation. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A method of processing a signal including speech, the method comprising the steps of:
-
receiving the signal; generating a silence model from a portion of the signal which precedes the speech; and using the generated silence model to perform a speech recognition operation on the speech included in the received signal. - View Dependent Claims (21, 22, 23, 24)
-
-
25. A speech recognition system, comprising:
-
means for receiving a signal including a segment of speech; means for generating a silence model from a portion of said signal which precedes the segment of speech; and means for performing a speech recognition operation on said segment of speech using the generated silence model and at least one additional speech recognition model.
-
Specification