Speaker-trained speech recognizer having the capability of detecting confusingly similar vocabulary words
First Claim
1. A speaker-trained speech recognizer for selecting words to be added to a memory of said recognizer, said recognizer comprisingmeans for extracting a plurality of feature signals from a received word utterance from a speaker,means for generating a plurality of parameters derived from said plurality of feature signals,means for comparing a plurality of feature signals of a first word utterance extracted by said extracting means against a plurality of stored parameters of a different previously received word utterance using a predetermined criteria, an output of said comparing means being determined by the difference between said plurality of feature signals of said first word utterance and the plurality of stored parameters of said previously received word utterance,means for signaling said speaker to utter into said recognizer a second word utterance, which is different from said first word utterance, when the output of said comparing means is less than a predetermined value indicating a similarity between said feature signals of said first word utterance and said previously received word utterance parameters, andmeans for storing a plurality of parameters of said first word utterance as another different word utterance in said memory when the output of said comparing means indicates that said difference is not less than said predetermined value.
0 Assignments
0 Petitions
Accused Products
Abstract
During a training sequence, a speaker-trained speech recognizer detects and signals the speaker when vocabulary word pairs are potentially confusing to the recognizer. Each vocabulary word is converted into feature signals and then parameters representing a predetermined reference model of that word. The feature signals of a subsequent potential vocabulary word are compared against the reference model of each vocabulary word previously stored in the recognizer memory. The speaker is signaled when the potential vocabulary word is confusingly similar to one of the existing vocabulary words.
-
Citations
11 Claims
-
1. A speaker-trained speech recognizer for selecting words to be added to a memory of said recognizer, said recognizer comprising
means for extracting a plurality of feature signals from a received word utterance from a speaker, means for generating a plurality of parameters derived from said plurality of feature signals, means for comparing a plurality of feature signals of a first word utterance extracted by said extracting means against a plurality of stored parameters of a different previously received word utterance using a predetermined criteria, an output of said comparing means being determined by the difference between said plurality of feature signals of said first word utterance and the plurality of stored parameters of said previously received word utterance, means for signaling said speaker to utter into said recognizer a second word utterance, which is different from said first word utterance, when the output of said comparing means is less than a predetermined value indicating a similarity between said feature signals of said first word utterance and said previously received word utterance parameters, and means for storing a plurality of parameters of said first word utterance as another different word utterance in said memory when the output of said comparing means indicates that said difference is not less than said predetermined value.
-
11. A method of operating a speaker-trained speech recognizer comprising the steps of
extracting a plurality of feature signals from a received word utterance from a speaker, generating a plurality of parameters derived from said plurality of feature signals, comparing a plurality of feature signals of a first word utterance extracted by said extracting step against a plurality of stored parameters of a different previously received word utterance using a predetermined criteria, an output of said comparing being determined by the difference between said plurality of feature signals of said first word utterance and the plurality of stored parameters of said previously received different word utterance, signaling said speaker to utter into said recognizer a second word utterance, which is different from said first word utterance, when the output of said comparing step is less than a predetermined value indicating a similarity between said feature signals of said first word utterance and said previously received word utterance parameters, and storing a plurality of parameters of said first word utterance as another different word utterance in said memory when the output of said comparing step indicates that said difference is not less than said predetermined value.
Specification