Automatic speech recognition system using seed templates
First Claim
1. A training subsystem for an automatic speech recognition system having a training capability for training a vocabulary of words to be recognized by said system, comprising:
- a seed template set for maintaining templates of template parameters for a limited set of seed words which are preselected to be short, commonly-used, and easily recognized words;
a training input for providing a training speech sample for training each vocabulary word to be recognized by the automatic speech recognition system, wherein each training speech sample consists of a spoken phrase of a vocabulary word juxtaposed with at least one seed word included in said seed template set;
an extractor for extracting template parameters for each of the words of a training speech sample provided by said training input, wherein said extractor is enabled to extract the template parameters for the vocabulary word of the training speech sample by using the template maintained in said seed template set for the at least one seed word of the training speech sample;
a training control module for controlling said extractor to provide the extracted template parameters for each vocabulary word of the respective training speech samples and for generating corresponding vocabulary word templates;
a dictionary storage for storing the templates for the respective vocabulary words as extracted by said extractor and generated under control of said training control module; and
said training control module being operative for controlling said extractor to provide the extracted template parameters for the at least one seed word of the training speech sample and for updating the corresponding seed word template of said seed template set so that the updated seed word template can be used for subsequent training speech samples.
1 Assignment
0 Petitions
Accused Products
Abstract
An automatic speech recognition system has a multi-mode training capability using a set of previously stored templates of a limited number of predetermined seed words to train the templates for a vocabulary of words. The training speech samples each includes a vocabulary word juxtaposed with a seed word. An averager module maintains an active average template for each of the word units of the training speech samples including the seed word units, and the active average templates are used to continuously update the seed template set as they are used in the training speech samples. The preferred training procedure employs training phrases each having a vocabulary word embedded between two seed words, and two seed template sets are used in succession, the first being composed of single-digit words, and the second composed of carrier words.
-
Citations
14 Claims
-
1. A training subsystem for an automatic speech recognition system having a training capability for training a vocabulary of words to be recognized by said system, comprising:
-
a seed template set for maintaining templates of template parameters for a limited set of seed words which are preselected to be short, commonly-used, and easily recognized words; a training input for providing a training speech sample for training each vocabulary word to be recognized by the automatic speech recognition system, wherein each training speech sample consists of a spoken phrase of a vocabulary word juxtaposed with at least one seed word included in said seed template set; an extractor for extracting template parameters for each of the words of a training speech sample provided by said training input, wherein said extractor is enabled to extract the template parameters for the vocabulary word of the training speech sample by using the template maintained in said seed template set for the at least one seed word of the training speech sample; a training control module for controlling said extractor to provide the extracted template parameters for each vocabulary word of the respective training speech samples and for generating corresponding vocabulary word templates; a dictionary storage for storing the templates for the respective vocabulary words as extracted by said extractor and generated under control of said training control module; and said training control module being operative for controlling said extractor to provide the extracted template parameters for the at least one seed word of the training speech sample and for updating the corresponding seed word template of said seed template set so that the updated seed word template can be used for subsequent training speech samples. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for training a vocabulary of words to be recognized by an automatic speech recognition system, comprising the steps of:
-
maintaining a seed template set of templates of template parameters for a limited set of seed words which are preselected to be short, commonly-used, and easily recognized words; providing a training speech sample for training each vocabulary word to be recognized by the automatic speech recognition system, wherein each training speech sample consists of a spoken phrase of a vocabulary word juxtaposed with at least one seed word included in said seed template set; extracting template parameters for each of the words of a training speech sample, wherein extracting the template parameters for the vocabulary word of the training speech sample is enabled by using the template maintained in said seed template set for the at least one seed word of the training speech sample; using the extracted template parameters for each vocabulary word of the respective training speech samples to generate corresponding vocabulary word templates; storing the templates for the respective vocabulary words in a dictionary storage to be used by the automatic speech recognition system for recognizing vocabulary words; and further using the extracted template parameters for the at least one seed word of the respective training speech samples to update the seed word templates of said seed template set so that the updated seed word templates can be used for subsequent training speech samples. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification