Method for constructing a model of a new word for addition to a word model database of a speech recognition system
First Claim
1. A method for constructing a model of a new word for addition to a word model database of a speech recognition system which includes a plurality of pre-existing word models based on an inventory of models of sub-word units, the method comprising the steps of:
- receiving a plurality of utterances that each purportedly conform to the new word;
determining an average length of the plurality of utterances;
constructing a whole word model of the new word, using the plurality of utterances but without reference to the inventory of models of sub-word units, wherein the whole word model has a length equal to the average length of the plurality of utterances;
constructing a reference template represented by a sequence of averaged feature values of the plurality of utterances; and
,matching the sequence of averaged feature values to the models of sub-word units stored in the inventory of the speech recognition system in order to construct the model of the new word for addition to the word model database of the speech recognition system, whereby the thusly constructed model of the new word is based upon the inventory of models of sub-word units, thereby facilitating the recognition of subsequent utterances of the new word using the inventory of models of sub-word units.
1 Assignment
0 Petitions
Accused Products
Abstract
For speech recognition a new word is represented as based on a stored inventory of models of sub-word units. First a plurality of utterances are presented that all should conform to the word. For building a word model from the utterances, these are represented by a sequence of feature vectors. First, the utterances are used to train a whole-word model that is independent of the models of the sub-word units. The length of the whole-word model equals the average length of the utterances. Next, a sequence of Markov states and associated probability densities of acoustic events of the whole-word model is interpreted as a reference template represented by a string of averaged feature vectors. Finally, the string is recognized by matching to models in the inventory and storing a recognition result as a model of the utterances.
60 Citations
4 Claims
-
1. A method for constructing a model of a new word for addition to a word model database of a speech recognition system which includes a plurality of pre-existing word models based on an inventory of models of sub-word units, the method comprising the steps of:
-
receiving a plurality of utterances that each purportedly conform to the new word; determining an average length of the plurality of utterances; constructing a whole word model of the new word, using the plurality of utterances but without reference to the inventory of models of sub-word units, wherein the whole word model has a length equal to the average length of the plurality of utterances; constructing a reference template represented by a sequence of averaged feature values of the plurality of utterances; and
,matching the sequence of averaged feature values to the models of sub-word units stored in the inventory of the speech recognition system in order to construct the model of the new word for addition to the word model database of the speech recognition system, whereby the thusly constructed model of the new word is based upon the inventory of models of sub-word units, thereby facilitating the recognition of subsequent utterances of the new word using the inventory of models of sub-word units. - View Dependent Claims (2)
-
-
3. A speech recognition device, comprising:
a word model database which includes a plurality of pre-existing word models based on an inventory of models of sub-word units; means for receiving a plurality of utterances that each purportedly conform to a new word which does not correspond to any of the plurality of pre-existing word models included in the word model database; means for determining an average length of the plurality of utterances; means for constructing a whole word model of the new word, using the plurality of utterances, but without reference to the inventory of models of sub-word units, wherein the whole word model has a length equal to the average length of the plurality of utterances; means for constructing a reference template represented by a sequence of averaged feature values of the plurality of utterances; and
,means for matching the sequence of averaged feature values to the models of subword units stored in the inventory in order to construct the model of the new word for addition to the word model database, whereby the thusly constructed model of the new word is based upon the inventory of models of sub-word units, thereby facilitating the recognition of subsequent utterances of the new word using the inventory of models of sub-word units. - View Dependent Claims (4)
Specification