Word boundary detector for speech recognition equipment
First Claim
1. In an apparatus which receives acoustic input, said input including words spoken in isolation, and performs recognition functions on said words, said apparatus including means for generating feature signals indicative of feature characteristics in the received input and means for comparing the feature signals which occur during determined time boundaries with stored features corresponding to words in a vocabulary;
- an improved system for detecting word boundaries, comprising;
(a) means responsive to said input for generating a first feature signal indicative of the presence of speech-like sounds which meet a first selection criterion;
(b) means for storing the feature signals which occur during the presence of said first feature signal;
(c) means responsive to said input for generating a second feature signal indicative of the presence of speech-like sounds which meet a second selection criterion; and
(d) means for determining the substantially last occurrence of said second feature signal among the stored feature signals, the end boundary of an input spoken word being determined as a function of said last occurrence.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention pertains to an apparatus which receives acoustic input, the input including words spoken in isolation, finds the word boundary instants at which a word begins and ends, and performs recognition functions on the words. A feature of the invention is the compensation for breath noise after the true end of a word, using a variable backup of the estimated word end. The apparatus includes means for generating feature signals indicative of feature characteristics in the received input and further includes means for comparing the feature signals which occurred during determined time boundaries with stored features corresponding to words in a vocabulary. The invention is directed to an improved system for detecting word boundaries which includes a means responsive to the input for generating a first feature signal indicative of the substantially continuing presence of speech-like sounds which meet a first selection criterion. Means are provided for storing the feature signals which occur during the presence of this first feature signal. Further means, responsive to the input, are provided for generating a second feature signal indicative of the presence of speech-like sounds which meet a second selection criterion, this second selection criterion being more restrictive than the first selection criterion. Means are also provided for determining the last occurrence of the second feature signal among the stored feature signals. The end boundary of an input spoken word is determined as a function of this last occurrence.
-
Citations
11 Claims
-
1. In an apparatus which receives acoustic input, said input including words spoken in isolation, and performs recognition functions on said words, said apparatus including means for generating feature signals indicative of feature characteristics in the received input and means for comparing the feature signals which occur during determined time boundaries with stored features corresponding to words in a vocabulary;
- an improved system for detecting word boundaries, comprising;
(a) means responsive to said input for generating a first feature signal indicative of the presence of speech-like sounds which meet a first selection criterion; (b) means for storing the feature signals which occur during the presence of said first feature signal; (c) means responsive to said input for generating a second feature signal indicative of the presence of speech-like sounds which meet a second selection criterion; and (d) means for determining the substantially last occurrence of said second feature signal among the stored feature signals, the end boundary of an input spoken word being determined as a function of said last occurrence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- an improved system for detecting word boundaries, comprising;
-
9. In an apparatus which receives acoustic input, said input including words spoken in isolation, and performs recognition functions on said words, said apparatus including means for generating feature signals indicative of feature characteristics in the received input, and means for comparing the feature signals which occur during determined time boundaries with stored features corresponding to words in a vocabulary;
- an improved system for detecting word boundaries, comprising;
(a) means responsive to said input for generating a first feature signal indicative of the presence of speech-like sounds which meet a predetermined energy threshold criterion; (b) means for storing the feature signals which occur during the presence of said first feature signal; (c) means responsive to said input for generating a third feature signal indicative of the presence of a voiced phonetic characteristic in said input; (d) means responsive to said input for generating a fourth feature signal indicative of the presence of an unvoiced noise-like consonant in said input; (e) means for generating a second feature signal as a function of said third and fourth feature signals; and (f) means for determining the substantially last occurrence of said second feature signal among the stored feature signals, the end boundary of an input spoken word being a function of said last occurrence. - View Dependent Claims (10)
- an improved system for detecting word boundaries, comprising;
-
11. In conjunction with an apparatus which receives acoustic input that includes spoken words in isolation and performs recognition functions on said words, the apparatus generating signals indicative of feature characteristics in the received input and comparing signals which occur during determined time boundaries with stored features corresponding to words in a vocabulary;
- a method for detecting word boundaries, comprising the steps of;
(a) generating a first feature signal indicative of the presence of speech-like sounds which meet a first selection criterion; (b) storing the feature signals which occur during the presence of said first feature signal; (c) generating a second feature signal indicative of the presence of speech-like sounds which meet a second more restrictive selection criterion; and (d) determining the substantially last occurrence of the second feature signal among the stored feature signals, the end boundary of an input spoken word being a function of said last occurrence.
- a method for detecting word boundaries, comprising the steps of;
Specification