Speech recognition using polynomial expansion and hidden markov models
First Claim
Patent Images
1. A speech recognition system, comprising:
- a first section having an input for receiving a spoken command and providing a polynomial expansion of a feature vector generated for the spoken command in a non-training mode;
a second section that provides a polynomial expansion of a feature vector generated in a training mode; and
a third section having a correlator block that correlates the polynomial expansion of the feature vector from the first section with the polynomial expansion of the feature vector from the second section, wherein the third section performs a Hidden Markov Model statistical analysis of a correlated feature vector wherein the third section further includes;
a sequence vector block having an input for receiving a signal from the correlator block;
an HMM table having an output; and
a Viterbi block having a first input coupled to the sequence vector block, a second input coupled to the HMM table, and an output that provides a state sequence that maximizes a probability of identifying the spoken command.
19 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition system (10) having a sampler block (12) and a feature extractor block (14) for extracting time domain and spectral domain parameters from a spoken input speech into a feature vector. A polynomial expansion block (16) generates polynomial coefficients from the feature vector. A correlator block (20), a sequence vector block (22), an HMM table (24) and a Viterbi block (26) perform the actual speech recognition based on the speech units stored in a speech unit table (18) and the HMM word models stored in the HMM table (24). The HMM word model that produces the highest probability is determined to be the word that was spoken.
-
Citations
4 Claims
-
1. A speech recognition system, comprising:
-
a first section having an input for receiving a spoken command and providing a polynomial expansion of a feature vector generated for the spoken command in a non-training mode;
a second section that provides a polynomial expansion of a feature vector generated in a training mode; and
a third section having a correlator block that correlates the polynomial expansion of the feature vector from the first section with the polynomial expansion of the feature vector from the second section, wherein the third section performs a Hidden Markov Model statistical analysis of a correlated feature vector wherein the third section further includes;
a sequence vector block having an input for receiving a signal from the correlator block;
an HMM table having an output; and
a Viterbi block having a first input coupled to the sequence vector block, a second input coupled to the HMM table, and an output that provides a state sequence that maximizes a probability of identifying the spoken command. - View Dependent Claims (2, 3)
-
-
4. A method of identifying a spoken command, the method comprising:
-
providing a training mode for sampling speech that includes, extracting a First set of feature vectors from the sampled speech, averaging consecutive polynomial expansions prior to generating a polynomial expansion of the first set of feature vectors, generating the polynomial expansion of the first set of feature vectors, and quantizing the polynomial expansion;
providing a non-training mode for a speech input that includes, extracting a second set of feature vectors from the speech input, and generating a polynomial expansion of the second set of feature vectors;
correlating the first higher-order vectors generated in the training mode with the second higher-order vectors generated from the spoken command in the non-training mode; and
providing a statistical analysis based on a Hidden Markov Model to identify the spoken command.
-
Specification