Use of periodicity and jitter for automatic speech recognition
First Claim
Patent Images
1. A method of speech recognition comprising the step of:
- a. starting with a standard feature vector;
b. including at least one voicing feature and at least one time derivative of at least one voicing feature with said standard feature vector; and
c. using said standard feature vector with said included features to recognize speech.
3 Assignments
0 Petitions
Accused Products
Abstract
A class of features related to voicing parameters that indicate whether the vocal chords are vibrating. Features describing voicing characteristics of speech signals are integrated with an existing 38-dimensional feature vector consisting of first and second order time derivatives of the frame energy and of the cepstral coefficients with their first and second derivatives. Hidden Markov Model (HMM)-based connected digit recognition experiments comparing the traditional and extended feature sets show that voicing features and spectral information are complementary and that improved speech recognition performance is obtained by combining the two sources of information.
18 Citations
18 Claims
-
1. A method of speech recognition comprising the step of:
-
a. starting with a standard feature vector; b. including at least one voicing feature and at least one time derivative of at least one voicing feature with said standard feature vector; and c. using said standard feature vector with said included features to recognize speech. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. An apparatus for speech recognition, comprising:
-
means for determining a standard feature vector; means for storing a standard feature vector after it has been determined; means for including at least one voicing feature and at least one time derivative of at least one voicing feature with said stored standard feature vector; and means for using said stored standard feature vector and said included voicing features to recognize speech; wherein an error rate for speech recognition is reduced because of a robustness resulting from including the at least one voicing feature and the at least one time derivative of at least one voicing feature. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification