Speech recognition using polynomial expansion and hidden markov models

US 6,928,409 B2
Filed: 05/31/2001
Issued: 08/09/2005
Est. Priority Date: 05/31/2001
Status: Expired due to Fees

First Claim

Patent Images

1. A speech recognition system, comprising:

a first section having an input for receiving a spoken command and providing a polynomial expansion of a feature vector generated for the spoken command in a non-training mode;

a second section that provides a polynomial expansion of a feature vector generated in a training mode; and

a third section having a correlator block that correlates the polynomial expansion of the feature vector from the first section with the polynomial expansion of the feature vector from the second section, wherein the third section performs a Hidden Markov Model statistical analysis of a correlated feature vector wherein the third section further includes;

a sequence vector block having an input for receiving a signal from the correlator block;

an HMM table having an output; and

a Viterbi block having a first input coupled to the sequence vector block, a second input coupled to the HMM table, and an output that provides a state sequence that maximizes a probability of identifying the spoken command.

View all claims

19 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A speech recognition system (10) having a sampler block (12) and a feature extractor block (14) for extracting time domain and spectral domain parameters from a spoken input speech into a feature vector. A polynomial expansion block (16) generates polynomial coefficients from the feature vector. A correlator block (20), a sequence vector block (22), an HMM table (24) and a Viterbi block (26) perform the actual speech recognition based on the speech units stored in a speech unit table (18) and the HMM word models stored in the HMM table (24). The HMM word model that produces the highest probability is determined to be the word that was spoken.

Citations

4 Claims

1. A speech recognition system, comprising:
- a first section having an input for receiving a spoken command and providing a polynomial expansion of a feature vector generated for the spoken command in a non-training mode;
  
  a second section that provides a polynomial expansion of a feature vector generated in a training mode; and
  
  a third section having a correlator block that correlates the polynomial expansion of the feature vector from the first section with the polynomial expansion of the feature vector from the second section, wherein the third section performs a Hidden Markov Model statistical analysis of a correlated feature vector wherein the third section further includes;
  
  a sequence vector block having an input for receiving a signal from the correlator block;
  
  an HMM table having an output; and
  
  a Viterbi block having a first input coupled to the sequence vector block, a second input coupled to the HMM table, and an output that provides a state sequence that maximizes a probability of identifying the spoken command.
- View Dependent Claims (2, 3)
- - 2. The speech recognition system of claim 1, wherein the first section further includes:
    - a sampler block having an input for receiving the spoken command;
      
      a feature extractor having an input coupled to an output of the sampler block; and
      
      a polynomial expansion block having an input coupled to the feature extractor and an output that provides the polynomial expansion of the feature vector generated for the spoken command.
  - 3. The speech recognition system of claim 1, wherein the second section further includes:
    - a feature vector generator;
      
      a polynomial expansion block having an input coupled to the feature vector generator, a vector quantizer block having an input coupled to an output of the polynomial expansion block; and
      
      a processing block having an input coupled to an output of the vector quantizer block and an output that provides the polynomial expansion of the feature vector generated in the training mode.

4. A method of identifying a spoken command, the method comprising:
- providing a training mode for sampling speech that includes, extracting a First set of feature vectors from the sampled speech, averaging consecutive polynomial expansions prior to generating a polynomial expansion of the first set of feature vectors, generating the polynomial expansion of the first set of feature vectors, and quantizing the polynomial expansion;
  
  providing a non-training mode for a speech input that includes, extracting a second set of feature vectors from the speech input, and generating a polynomial expansion of the second set of feature vectors;
  
  correlating the first higher-order vectors generated in the training mode with the second higher-order vectors generated from the spoken command in the non-training mode; and
  
  providing a statistical analysis based on a Hidden Markov Model to identify the spoken command.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
North Star Innovations Inc. (Wi-LAN Inc.)
Original Assignee
Freescale Semiconductor, Inc. (NXP Semiconductors NV)
Inventors
Barron, David L., Yip, William Chunhung
Primary Examiner(s)
Chawan, Vijay B.

Application Number

US09/871,063
Publication Number

US 20020184025A1
Time in Patent Office

1,531 Days
Field of Search

704/256, 704/242, 704/243, 704/231, 704/254, 704/232, 704/200, 706/20
US Class Current

704/256
CPC Class Codes

G10L 15/02   Feature extraction for spee...

G10L 15/142   Hidden Markov Models [HMMs]

G10L 25/27   characterised by the analys...

Speech recognition using polynomial expansion and hidden markov models

First Claim

19 Assignments

0 Petitions

Accused Products

Abstract

Citations

4 Claims

Specification

Solutions

Use Cases

Quick Links

Speech recognition using polynomial expansion and hidden markov models

First Claim

19 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

4 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links