Method for speech recognition using partitioned vocabulary
First Claim
Patent Images
1. A method for recognizing a spoken input using a predefinable vocabulary, comprising:
- organizing the predefinable vocabulary, prior to receiving the spoken input, based on distance measures of phonetic similarity between pairs of words in the predefinable vocabulary byobtaining ranking values of each pair of words in the predefinable vocabulary as possible matches for test utterances independent of the pair of words,averaging, for each pair of words in the predefinable vocabulary, differences between the ranking values for all of the test utterances, to obtain the distance measures;
storing, on a storage medium, the predefinable vocabulary subdivided into sections of phonetically similar words based on the distance measures and using a vector quantization algorithm; and
characterizing each of the sections of the phonetically similar words by a representative entry;
assigning the spoken input to one of the sections for which the representative entry is most similar to the spoken input; and
identifying a closest match for the spoken input among the phonetically similar words in the one of the sections that has been assigned to the spoken input.
1 Assignment
0 Petitions
Accused Products
Abstract
A is recognized using a predefinable vocabulary that is partitioned in sections of phonetically similar words. In a recognition process, first oral input is associated with one of the sections, then the oral input is determined from the vocabulary of the associated section.
13 Citations
6 Claims
-
1. A method for recognizing a spoken input using a predefinable vocabulary, comprising:
-
organizing the predefinable vocabulary, prior to receiving the spoken input, based on distance measures of phonetic similarity between pairs of words in the predefinable vocabulary by obtaining ranking values of each pair of words in the predefinable vocabulary as possible matches for test utterances independent of the pair of words, averaging, for each pair of words in the predefinable vocabulary, differences between the ranking values for all of the test utterances, to obtain the distance measures; storing, on a storage medium, the predefinable vocabulary subdivided into sections of phonetically similar words based on the distance measures and using a vector quantization algorithm; and characterizing each of the sections of the phonetically similar words by a representative entry; assigning the spoken input to one of the sections for which the representative entry is most similar to the spoken input; and identifying a closest match for the spoken input among the phonetically similar words in the one of the sections that has been assigned to the spoken input. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer readable medium encoding a computer program which when executed by a processor causes the processor to perform a method comprising:
-
organizing the predefinable vocabulary, prior to receiving the spoken input, based on a distance measure for phonetic similarity between pairs of words in the predefinable vocabulary by obtaining ranking values of each pair of words in the predefinable vocabulary as possible matches for test utterances independent of the pair of words, and averaging, for each pair of words in the predefinable vocabulary, differences between the ranking values of the pairs of words for the test utterances to obtain the distance measures; storing, on a storage medium, the predefinable vocabulary subdivided into sections of phonetically similar words based on the distance measures and using a vector quantization algorithm; and characterizing each of the sections of the phonetically similar words by a representative entry; assigning the spoken input to one of the sections for which the representative entry is most similar to the spoken input; and identifying a closest match for the spoken input among the phonetically similar words in the one of the sections that has been assigned to the spoken input.
-
Specification