Speech recognition system allows new vocabulary words to be added without requiring spoken samples of the words
First Claim
1. In a computer system, a speech recognition method comprising the steps of:
- a) receiving a user spoken word (USW);
b) generating score parameters for each of a plurality of first phoneme strings by comparing output values of each against the USW;
c) selecting one of the first phoneme strings having a best correlation to the USW based on said score parameters, said one phoneme string corresponding to a first word in a stored database;
d) generating a decision field having a first region that contains a first set of response signals and a second region that contains to a second set of response signals, said first set of response signals including response signals obtained by exciting said one phoneme string, said second set of response signals obtained by exciting a second string of phonemes that differs from said one phoneme string;
e) generating a third response signal based on exciting said one phoneme string with the USW;
f) determining whether said USW is a valid input of the first word based on a comparison of said third response signal to said decision field, said USW comprising a valid input of the first word if said third response signal is within said first region and an invalid input of the first word if said third response signal is within said second region.
6 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition method implemented in a computer system recognizes words without requiring prior creation of models for such words based on spoken entries. A key word is entered in nonspoken form and a string of phonemes are defined by the speech recognizer to represent the new key word. A response signal is generated from each phoneme in the new key word model. Such response signals are utilized to define a multidimensional validity field for the new key word. Upon receipt of a spoken word from a user, a string of phonemes is assigned to represent the spoken word. A response signal from each phoneme in the model used to represent the spoken word is contrasted with the validity fields previously defined for the corresponding key word. A determination is made as to whether the spoken word is valid or not based on whether the response signals representing the spoken word lie within the validity fields.
-
Citations
18 Claims
-
1. In a computer system, a speech recognition method comprising the steps of:
-
a) receiving a user spoken word (USW); b) generating score parameters for each of a plurality of first phoneme strings by comparing output values of each against the USW; c) selecting one of the first phoneme strings having a best correlation to the USW based on said score parameters, said one phoneme string corresponding to a first word in a stored database; d) generating a decision field having a first region that contains a first set of response signals and a second region that contains to a second set of response signals, said first set of response signals including response signals obtained by exciting said one phoneme string, said second set of response signals obtained by exciting a second string of phonemes that differs from said one phoneme string; e) generating a third response signal based on exciting said one phoneme string with the USW; f) determining whether said USW is a valid input of the first word based on a comparison of said third response signal to said decision field, said USW comprising a valid input of the first word if said third response signal is within said first region and an invalid input of the first word if said third response signal is within said second region. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A speech recognition system comprising:
-
a) means for receiving a user spoken word (USW); b) means for generating score parameters for each of a plurality of first phoneme strings by comparing output values of each against the USW; c) means for selecting one of the first phoneme strings having a best correlation to the USW based on said score parameters, said one phoneme string corresponding to a first word in a stored database; d) means for generating a decision field having a first region that contains a first set of response signals and a second region that contains to a second set of response signals, said first set of response signals including response signals obtained by exciting said one phoneme string, said second set of response signals obtained by exciting a second string of phonemes that differs from said one phoneme string; e) means for generating a third response signal based on exciting said one phoneme string with the USW; f) means for determining whether said USW is a valid input of the first word based on a comparison of said third response signal to said decision field, said USW comprising a valid input of the first word if said third response signal is within said first region and an invalid input of the first word if said third response signal is within said second region. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification