Word recognizer
First Claim
Patent Images
1. Apparatus for recognizing an input word as one of a set of reference words comprisingmeans for storing a plurality of reference word feature templates representative of said reference words;
- means responsive to said input word for generating an input word feature template;
means responsive to said reference word feature templates and said input word feature template for identifying said input word as one of said reference words, characterized in thatsaid input word identifying means (100) comprisesmeans (105,200) responsive to said reference word feature templates and said input word feature template for generating a set of word distance signals;
said reference word feature templates and said input word feature template each have a plurality of frames;
said means for generating a set of word distance signals comprises means responsive to said reference word feature templates and said input word feature template for generating a set of frame distance signals representative of the similarity between frames of said reference word feature templates and said input word feature template, and means for combining said frame distance signals to produce said word distance signals;
means (105, 200,
500) responsive to said reference word feature templates and said input word feature template for generating a set of weighted word distance signals; and
means (300,
400) responsive to said word distance signals and said weighted word distance signals for selecting the reference word which most corresponds to said input word.
1 Assignment
0 Petitions
Accused Products
Abstract
An input word is recognized as one of a set of reference words. A set of word distance signals representative of the correspondence of the input word to the reference words is generated. A set of weighted word distance signals is also generated. Responsive to the word distance signals and the weighted word distance signals, the reference word that most closely corresponds to the input word is selected.
51 Citations
16 Claims
-
1. Apparatus for recognizing an input word as one of a set of reference words comprising
means for storing a plurality of reference word feature templates representative of said reference words; -
means responsive to said input word for generating an input word feature template; means responsive to said reference word feature templates and said input word feature template for identifying said input word as one of said reference words, characterized in that said input word identifying means (100) comprises means (105,200) responsive to said reference word feature templates and said input word feature template for generating a set of word distance signals; said reference word feature templates and said input word feature template each have a plurality of frames; said means for generating a set of word distance signals comprises means responsive to said reference word feature templates and said input word feature template for generating a set of frame distance signals representative of the similarity between frames of said reference word feature templates and said input word feature template, and means for combining said frame distance signals to produce said word distance signals; means (105, 200,
500) responsive to said reference word feature templates and said input word feature template for generating a set of weighted word distance signals; andmeans (300,
400) responsive to said word distance signals and said weighted word distance signals for selecting the reference word which most corresponds to said input word. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for recognizing an input word as one of a set of reference words comprising the steps of
storing a plurality of reference word feature templates representative of said reference words; -
generating an input word feature template responsive to said input word; identifying said input word as one of said reference words responsive to said reference word feature templates and said input word feature template, characterized in that said input word identifying step comprises generating a set of word distance signals responsive to said reference word feature templates and said input word feature template; said reference word feature templates and said input word feature template each have a plurality of frames; said step for generating a set of word distance signals comprises generating a set of frame distance signals representative of the similarity between frames of said reference word feature templates and said input word feature template responsive to said reference word feature templates and said input word feature template, and combining said frame distance signals to produce said word distance signals; generating a set of weighted word distance signals responsive to said reference word feature templates and said input word feature template; and selecting the reference word which most corresponds to said input word responsive to said word distance signals and said weighted word distance signals. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. Apparatus for recognizing a spoken input word as one of a set of reference words comprising
means for storing a plurality of reference word feature templates representative of acoustic characteristics of said reference words; - means responsive to said input word for generating an input word feature template representative of acoustic characteristics of said input word;
means responsive to said reference word feature templates and said input word feature template for identifying said input word as one of said reference words;
said input word identifying means including means responsive to said reference word feature templates and said input word feature templates for generating a set of word distance signals;
said reference word feature templates and said input word feature template each have a plurality of frames;
said reference words for which feature templates are stored belong to a predetermined set of equivalence classes, each class being representative of reference words which are within a prescribed degree of similarity;said means for generating a set of word distance signals comprises means responsive to said reference word feature templates and said input word feature template for generating a set of frame distance signals representative of the similarity between frames of said reference word feature templates and said input word feature template, and means for summing said frame distance signals to product said word distance signals; means responsive to said reference word feature templates and said input word feature template for generating a set of weighted word distance signals;
said means for generating a set of weighted word distance signals comprises means for storing a plurality of weight templates representative of the expected similarity between frames of reference word feature templates for reference words which belong to the same equivalence class, means responsive to said frame distance signals and said weight templates for generating weighted frame distance signals, and means for summing said weighted frame distance signals to produce said weighted word distance signals; andmeans responsive to said word distance signals and said weighted word distance signals for selecting the reference word which most corresponds to said input word. - View Dependent Claims (14)
- means responsive to said input word for generating an input word feature template representative of acoustic characteristics of said input word;
-
15. A method for recognizing a spoken input word as one of a set of reference words comprising
storing a plurality of reference word feature templates representative of acoustic characteristics of said reference words; - generating an input word feature template representative of acoustic characteristics of said input word responsive to said input word;
identifying said input word as one of said reference words responsive to said reference word feature templates and said input word feature template;said input word identifying step comprises generating a set of word distance signals responsive to said reference word feature templates and said input word feature template;
said reference word feature templates and said input word feature template each have a plurality of frames;
said reference words for which feature templates are stored belong to a predetermined set of equivalence classes, each class being representative of reference words which are within a prescribed degree of similarity;said step for generating a set of word distance signals comprises generating a set of frame distance signals representative of the similarity between frames of said reference word feature templates and said input word feature template responsive to said reference word feature templates and said input word feature template, and summing said frame distance signals to produce said word distance signals; generating a set of weighted word distance signals responsive to said reference word feature templates and said input word feature template;
said step for generating a set of weighted word distance signals comprises storing a plurality of weight templates representative of the expected similarity between frames of reference word feature templates for reference words which belong to the same equivalence class, generating weighted frame distance signals responsive to said frame distance signals and said weight templates, and summing said weighted frame distance signals to produce said weighted word distance signals; andselecting the reference word which most corresponds to said input word responsive to said word distance signals and said weighted word distance signals. - View Dependent Claims (16)
- generating an input word feature template representative of acoustic characteristics of said input word responsive to said input word;
Specification