Speech recognition apparatus and method
First Claim
Patent Images
1. Apparatus which receives input spoken vocabulary words during a training phase of operation and subsequently recognizes received input spoken command words, comprising:
- feature extraction means for processing received input words and generating digital feature signals dependent upon the features present in said input words;
means for forming, for each vocabulary word, a time dependent reference array having digital information at each array position representative of the presence or absence of a particular feature signal at a particular time slot during at least a predetermined fraction of a number of training utterances of said vocabulary word and also representative of the consistency of occurrence of said particular feature signal at said particular time slot during said number of training utterances of said vocabulary word;
means for forming a time dependent command word feature array having digital information at each array position representative of the presence or absence of a particular feature signal at a particular time slot during a command word candidate;
means for comparing, member-by-member, the command word feature array with the reference array for each vocabulary word; and
means for selecting the vocabulary word those reference array yields the highest correlation with said command word feature array.
1 Assignment
0 Petitions
Accused Products
Abstract
In this speech recognition system the array formed by a timewise sequence of speech signal feature vectors includes digital data at each time slot representing both presence/absence and consistency of occurence.
-
Citations
24 Claims
-
1. Apparatus which receives input spoken vocabulary words during a training phase of operation and subsequently recognizes received input spoken command words, comprising:
-
feature extraction means for processing received input words and generating digital feature signals dependent upon the features present in said input words; means for forming, for each vocabulary word, a time dependent reference array having digital information at each array position representative of the presence or absence of a particular feature signal at a particular time slot during at least a predetermined fraction of a number of training utterances of said vocabulary word and also representative of the consistency of occurrence of said particular feature signal at said particular time slot during said number of training utterances of said vocabulary word; means for forming a time dependent command word feature array having digital information at each array position representative of the presence or absence of a particular feature signal at a particular time slot during a command word candidate; means for comparing, member-by-member, the command word feature array with the reference array for each vocabulary word; and means for selecting the vocabulary word those reference array yields the highest correlation with said command word feature array. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method for receiving input spoken vocabulary words during a training phase of operation and subsequently recognizing input spoken command words, comprising the steps of:
-
generating digital feature signals dependent upon the features present in said received input words; forming, for each vocabulary word, a time dependent reference array having digital information at each array position representative of the presence or absence of a particular feature signal at a particular time slot during at least a predetermined fraction of a number of training utterances of said vocabulary word and also representative of the consistency of occurrence of said particular feature signal at said particular time slot during said number of training utterances of said vocabulary word; forming a time dependent command word feature array having digital information at each array position representative of the presence or absence of a particular feature at a particular time slot during a command word candidate; comparing, member-by-member, the command word feature array with the reference array for each vocabulary word; and selecting the vocabulary word whose reference array yields the highest correlation with said command word feature array. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
Specification