Method and apparatus for formant tracking using a residual model

US 7,424,423 B2
Filed: 04/01/2003
Issued: 09/09/2008
Est. Priority Date: 04/01/2003
Status: Expired due to Fees

First Claim

Patent Images

1. A method of tracking formants in a speech signal, the method comprising:

defining a formant search space comprising sets of formants;

identifying formants in a first frame of the speech signal using the entirety of the formant search space by utilizing a residual model that models a difference between an input feature vector representing a frame of the speech signal and a feature vector mapped from a set of formants, wherein utilizing the residual model comprises sequentially applying feature vectors mapped from each of the sets of formants in the formant search space to the residual model to identify a probability for each set of formants; and

identifying formants in a second frame of the speech signal using the entirety of the formant search space.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method of tracking formants defines a formant search space comprising sets of formants to be searched. Formants are identified for a first frame in the speech utterance by searching the entirety of the formant search space using the codebook, and for the remaining frames by searching the same space using both the codebook and the continuity constraint across adjacent frames. Under one embodiment, the formants are identified by mapping sets of formants into feature vectors and applying the feature vectors to a model. Formants are also identified by applying dynamic programming to search for the best sequence that optimally satisfies the continuity constraint required by the model.

Citations

8 Claims

1. A method of tracking formants in a speech signal, the method comprising:
- defining a formant search space comprising sets of formants;
  
  identifying formants in a first frame of the speech signal using the entirety of the formant search space by utilizing a residual model that models a difference between an input feature vector representing a frame of the speech signal and a feature vector mapped from a set of formants, wherein utilizing the residual model comprises sequentially applying feature vectors mapped from each of the sets of formants in the formant search space to the residual model to identify a probability for each set of formants; and
  
  identifying formants in a second frame of the speech signal using the entirety of the formant search space.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. The method of claim 1 wherein the residual model is trained using an Expectation Maximization algorithm.
  - 3. The method of claim 1 wherein identifying formants in the first frame comprises selecting the set of formants with the highest probability as the set of formants for the first frame.
  - 4. The method of claim 1 wherein identifying formants in the first frame comprises performing a minimum mean squared error calculation using each of the sets of formants in the formant search space and the probabilities for the sets of formants.
  - 5. The method of claim 1 wherein identifying a probability for a set of formants further comprises determining a probability for transitioning from a set of formants identified in a previous frame to the set of formants in the first frame.
  - 6. The method of claim 5 further comprising identifying a total probability for each of a plurality of sequences of sets of formants for a sequence of frames in the speech signal.
  - 7. The method of claim 6 wherein identifying a set of formants for the first frame comprises selecting the sequence of sets of formants with the highest total probability.
  - 8. The method of claim 6 wherein identifying a set of formants for the first frame comprises making a minimum mean square error calculation using the sets of formants aligned with the first frame in each of the sequences of sets of formants.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
Microsoft Corporation
Inventors
Deng, Li, Acero, Alejandro, Bazzi, Issam
Primary Examiner(s)
Knepper; David D

Application Number

US10/404,411
Publication Number

US 20040199382A1
Time in Patent Office

1,988 Days
Field of Search

704205-220
US Class Current

704/209
CPC Class Codes

G10L 15/02 Feature extraction for spee...

G10L 25/15 the extracted parameters be...

Method and apparatus for formant tracking using a residual model

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

8 Claims

Specification

Solutions

Use Cases

Quick Links

Method and apparatus for formant tracking using a residual model

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

8 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links