Method and apparatus for transitioning from one voice recognition system to another
First Claim
1. A method of using a first set of speech characteristic information including one of a speech recognition template and a speech recognition model which was previously generated for use by a first speech recognition system, to generate a second set of speech characteristic information for use by a second speech recognition system, the method comprising the steps of:
- generating, from the one of the speech recognition template and model included in the first set of speech characteristic information, additional speech characteristic information not included in the first set of speech characteristic information; and
combining the generated additional speech characteristic information, with at least some information obtained from the first set of speech characteristic information, to generate the second set of speech characteristic information.
6 Assignments
0 Petitions
Accused Products
Abstract
Converting speech recognition templates or models from a first format to a second format improves the recognition rate achieved using the converted templates or models. Storing source and/or scoring information for templates or models so that converted models or templates can be scored differently than original models or templates reflects the effect the conversion process has on recognition scores. In order to enhance recognition results in one embodiment, an available compressed voice recording is used in the conversion process. The conversion process of the present invention is described using the conversion of dynamic time warping templates into Hidden Markov Models. Generating garbage models are also described. In one embodiment, a garbage model is generated dynamically at recognition time using a period of silence in the utterance upon which the recognition operation is to be performed as the source of the data required to generate the garbage model.
63 Citations
30 Claims
-
1. A method of using a first set of speech characteristic information including one of a speech recognition template and a speech recognition model which was previously generated for use by a first speech recognition system, to generate a second set of speech characteristic information for use by a second speech recognition system, the method comprising the steps of:
-
generating, from the one of the speech recognition template and model included in the first set of speech characteristic information, additional speech characteristic information not included in the first set of speech characteristic information; and combining the generated additional speech characteristic information, with at least some information obtained from the first set of speech characteristic information, to generate the second set of speech characteristic information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of using a first set of speech characteristic information which was previously generated for use by a first speech recognition system, to generate a second set of speech characteristic information for use by a second speech recognition system, the first set of speech characteristic information representing a segment of audible speech, the method comprising the steps of:
-
generating, from the first set of speech characteristic information, additional speech characteristic information not included in the first set of speech characteristic information; combining the generated additional speech characteristic information, with at least some information obtained from the first set of speech characteristic information, to generate the second set of speech characteristic information; decompressing a compressed voice recording; generating a third set of speech recognition characteristic information from the decompressed voice recording; and combining the second and third sets of speech characteristic information to generate a fourth set of speech characteristic information.
-
-
15. A method of using a first set of speech characteristic information which was previously generated for use by a first speech recognition system, to generate a second set of speech characteristic information and for using the second set of speech characteristic information, the first set of speech characteristic information representing a segment of audible speech, the method comprising the steps of:
-
generating, from the first set of speech characteristic information, additional speech characteristic information not included in the first set of speech characteristic information; combining the generated additional speech characteristic information, with at least some information obtained from the first set of speech characteristic information, to generate the second set of speech characteristic information; generating an indicator of the source of the second set of speech characteristic information; storing the second set of speech characteristic information and the source indicator in a database; using the second set of speech characteristic information and generated speaker dependent garbage model to perform a speech recognition operation on speech provided by a user against the seed. - View Dependent Claims (16, 17)
-
-
18. A method of converting a speech recognition template including a first set of speech characteristic data into a second speech recognition template including a second different set of speech characteristic data, the method comprising the steps of:
-
processing the first set of speech characteristic data included in the first template to produce therefrom a first generated set of speech characteristic data; decompressing a compressed speech recording to generate decompressed speech; processing the decompressed speech to generate a third set of speech characteristic data; generating the second set of speech characteristic data included in the second template, from the first set of speech characteristic data, the generated first set of speech characteristic data, and the third set of speech characteristic data. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25)
-
-
26. An apparatus comprising:
-
means for generating a second speaker dependent speech recognition template having a second format and second data content from a first speaker dependent speech template having a first format and a first data content, the first and second formats being different; and means for storing the second speaker dependent speech recognition template in a database. - View Dependent Claims (27, 28, 29, 30)
-
Specification