Method and apparatus for formant tracking using a residual model
First Claim
1. A method of tracking formants in a speech signal, the method comprising:
- defining a formant search space comprising sets of formants;
identifying formants in a first frame of the speech signal using the entirety of the formant search space by utilizing a residual model that models a difference between an input feature vector representing a frame of the speech signal and a feature vector mapped from a set of formants, wherein utilizing the residual model comprises sequentially applying feature vectors mapped from each of the sets of formants in the formant search space to the residual model to identify a probability for each set of formants; and
identifying formants in a second frame of the speech signal using the entirety of the formant search space.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of tracking formants defines a formant search space comprising sets of formants to be searched. Formants are identified for a first frame in the speech utterance by searching the entirety of the formant search space using the codebook, and for the remaining frames by searching the same space using both the codebook and the continuity constraint across adjacent frames. Under one embodiment, the formants are identified by mapping sets of formants into feature vectors and applying the feature vectors to a model. Formants are also identified by applying dynamic programming to search for the best sequence that optimally satisfies the continuity constraint required by the model.
-
Citations
8 Claims
-
1. A method of tracking formants in a speech signal, the method comprising:
-
defining a formant search space comprising sets of formants; identifying formants in a first frame of the speech signal using the entirety of the formant search space by utilizing a residual model that models a difference between an input feature vector representing a frame of the speech signal and a feature vector mapped from a set of formants, wherein utilizing the residual model comprises sequentially applying feature vectors mapped from each of the sets of formants in the formant search space to the residual model to identify a probability for each set of formants; and identifying formants in a second frame of the speech signal using the entirety of the formant search space. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
Specification