Orthogonal classification of words in multichannel speech recognizers
First Claim
1. A computerized method for distribution among a plurality of dictionaries of a target vocabulary including a plurality of words for use in a speech recognition application installed in a computer system, wherein each word of said target vocabulary is found in only one of the dictionaries, wherein the vocabulary and the dictionaries are stored in memory operatively attached to a computer system, the method comprising the steps of:
- (a) first categorizing the words based on phonetic length, thereby distributing the words into a plurality of first groups each of equal phonetic length(b) second categorizing said first groups based on combinations of vowel sounds, thereby placing the words of said first groups into a plurality of second groups each of identical vowel sounds;
(c) third categorizing the words of said second groups based on the consonants of the words of said second groups and placement of said consonants relative to said vowel sounds, thereby distributing the words into a plurality of third groups,(c) comparing pairwise phonetic distance between the words within each of said third groups thereby placing the words of said third groups into fourth groups of minimal phonetic distance; and
(d) distributing the words of said fourth groups into the multiple dictionaries.
1 Assignment
0 Petitions
Accused Products
Abstract
A computerized method for distribution among a multiple dictionaries of a target vocabulary. The vocabulary includes words for use in a speech recognition application installed in a computer system. Each word of the target vocabulary is found in only one of the dictionaries. The words are first categorized based on phonetic length, and distributed into multiple groups each of equal phonetic length. The first groups are secondly categorized based on combinations of vowel sounds. The words of the first groups are placed into second groups accordingly based on having identical vowel sounds. The second groups are thirdly categorized into third groups based on the consonants of the words of the second groups and placement of the consonants relative to the vowel sounds. The words within each of the third groups are compared in pairs for phonetic distance and the words of minimal pairwise phonetic distance between them are placed in fourth groups. The words of each of the fourth groups are distributed into the multiple dictionaries, preferably with no more than one member per fourth group distributed into each of the dictionaries. The multiple dictionaries are preferably mutually orthogonal, that is each of the dictionaries includes words of maximal phonetic distance from each other.
-
Citations
14 Claims
-
1. A computerized method for distribution among a plurality of dictionaries of a target vocabulary including a plurality of words for use in a speech recognition application installed in a computer system, wherein each word of said target vocabulary is found in only one of the dictionaries, wherein the vocabulary and the dictionaries are stored in memory operatively attached to a computer system, the method comprising the steps of:
-
(a) first categorizing the words based on phonetic length, thereby distributing the words into a plurality of first groups each of equal phonetic length (b) second categorizing said first groups based on combinations of vowel sounds, thereby placing the words of said first groups into a plurality of second groups each of identical vowel sounds; (c) third categorizing the words of said second groups based on the consonants of the words of said second groups and placement of said consonants relative to said vowel sounds, thereby distributing the words into a plurality of third groups, (c) comparing pairwise phonetic distance between the words within each of said third groups thereby placing the words of said third groups into fourth groups of minimal phonetic distance; and (d) distributing the words of said fourth groups into the multiple dictionaries. - View Dependent Claims (2, 3, 4, 5, 14)
-
-
6. A computerized method for distribution among a plurality of dictionaries of a target vocabulary including a plurality of words for use in a speech recognition application installed in a computer system, wherein each word of said target vocabulary is found in only one of the dictionaries, wherein the vocabulary and the dictionaries are stored in memory operatively attached to a computer system, the method comprising the steps of:
-
(a) comparing pairwise phonetic distance between the words thereby placing the words of into groups of minimal phonetic distance;
wherein said pairwise comparing is performed by at least one of the steps consisting of;
(i) comparing pairwise formants of the vowel sounds of the words, (ii) comparing an anatomical part most responsible for forming respective sounds; and
(iii) comparing empirically substitution of the words using a speech recognition engine;(b) distributing the words of said groups into the multiple dictionaries; and (c) processing an audio signal using multiple speech recognition engines, each engine referring to one of the dictionaries. - View Dependent Claims (7, 8, 9, 10, 11, 12)
-
-
13. A computer readable medium readable by a machine, tangibly embodying a program of instructions executable by the machine to perform a computerized method for distribution among a plurality of dictionaries of a target vocabulary including a plurality of words for use in a speech recognition application installed in a computer system, wherein each word of said target vocabulary is found in only one of the dictionaries, the method comprising the steps of:
-
(a) first categorizing the words based on phonetic length, thereby distributing the words into a plurality of first groups each of equal phonetic length (b) second categorizing said first groups based on combinations of vowel sounds, thereby placing the words of said first groups into a plurality of second groups each of identical vowel sounds; (c) third categorizing the words of said second groups based on the consonants of the words of said second groups and placement of said consonants relative to said vowel sounds, thereby distributing the words into a plurality of third groups, (c) comparing pairwise phonetic distance between the words within each of said third groups thereby placing the words of said third groups into fourth groups of minimal phonetic distance; and (d) distributing the words of said fourth groups into the multiple dictionaries.
-
Specification