Hidden trajectory modeling with differential cepstra for speech recognition
First Claim
1. A method comprising:
- receiving a speech input;
generating an interpretation of the speech input using hidden trajectory modeling with one or more observation vectors that are based at least in part on cepstra and on differential cepstra derived from the cepstra wherein using the hidden trajectory modeling comprises using prediction functions for the cepstra to estimate a mean for a distribution of target vocal tract resonances, and using prediction functions for the differential cepstra to estimate a vector-valued parameter related to the mean for the distribution of target vocal tract resonances, wherein the prediction functions are applied to time differences of sequential vocal tract resonance trajectories; and
providing an output based at least in part on the interpretation of the speech input.
2 Assignments
0 Petitions
Accused Products
Abstract
A novel system for speech recognition uses differential cepstra over time frames as acoustic features, together with the traditional static cepstral features, for hidden trajectory modeling, and provides greater accuracy and performance in automatic speech recognition. According to one illustrative embodiment, an automatic speech recognition method includes receiving a speech input, generating an interpretation of the speech, and providing an output based at least in part on the interpretation of the speech input. The interpretation of the speech uses hidden trajectory modeling with observation vectors that are based on cepstra and on differential cepstra derived from the cepstra. A method is developed that can automatically train the hidden trajectory model'"'"'s parameters that are corresponding to the components of the differential cepstra in the full acoustic feature vectors.
35 Citations
16 Claims
-
1. A method comprising:
-
receiving a speech input; generating an interpretation of the speech input using hidden trajectory modeling with one or more observation vectors that are based at least in part on cepstra and on differential cepstra derived from the cepstra wherein using the hidden trajectory modeling comprises using prediction functions for the cepstra to estimate a mean for a distribution of target vocal tract resonances, and using prediction functions for the differential cepstra to estimate a vector-valued parameter related to the mean for the distribution of target vocal tract resonances, wherein the prediction functions are applied to time differences of sequential vocal tract resonance trajectories; and providing an output based at least in part on the interpretation of the speech input. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
- 9. A computer-readable medium having computer-executable instructions which, when executed by a computing device with a processor, enable the computing device to receive, at the processor, a speech signal and use time-ordered differences in observation trajectory cepstra derived from the speech signal by the processor and time-ordered differences in vocal tract resonance trajectory cepstra derived from the speech signal by the processor to evaluate, with the processor, a probability of an observed value corresponding to a hidden trajectory valuer by evaluating a parameter-free, non-linear prediction function acting on the time-ordered differences in vocal tract resonance trajectory cepstra, and using the prediction function to evaluate a probability of the observed value corresponding to the hidden trajectory value and to output an interpretation of the speech signal based on the probability of the observed value.
- 15. A computer that runs an executable application for a speech recognition application, the speech recognition application comprising a hidden trajectory model with parameters trained using cepstra and differential cepstra, the computer running the speech recognition application to receive a speech signal, use the hidden trajectory model to recognize speech in the speech signal and output the recognized speech.
Specification