Apparatus and methods for identifying homophones among words in a speech recognition system
First Claim
1. A method of identifying homophones of a word uttered by a user from at least a portion of existing words of a vocabulary of a speech recognition engine, the method comprising the steps of:
- decoding the uttered word using the speech recognition engine to yield a decoded word;
computing respective measures between the decoded word and at least a portion of the other existing vocabulary words, the respective measures indicative of acoustic similarity between the word and the other existing words;
identifying the other existing words, associated with measures which correspond to a threshold range, as homophones of the uttered word; and
outputting the identified homophones, wherein the user can select an identified homophone that corresponds to the word uttered by the user.
3 Assignments
0 Petitions
Accused Products
Abstract
A method of identifying homophones of a word uttered by a user from at least a portion of existing words of a vocabulary of a speech recognition engine comprises the steps of: a user uttering the word; decoding the uttered word; computing respective measures between the decoded word and at least a portion of the other existing vocabulary words, the respective measures indicative of acoustic similarity between the word and the at least a portion of other existing words; if at least one measure is within a threshold range, indicating, to the user, results associated with the at least one measure, the results preferably including the decoded word and the other existing vocabulary word associated with the at least one measure; and the user preferably making a selection depending on the word the user intended to utter.
102 Citations
39 Claims
-
1. A method of identifying homophones of a word uttered by a user from at least a portion of existing words of a vocabulary of a speech recognition engine, the method comprising the steps of:
-
decoding the uttered word using the speech recognition engine to yield a decoded word;
computing respective measures between the decoded word and at least a portion of the other existing vocabulary words, the respective measures indicative of acoustic similarity between the word and the other existing words;
identifying the other existing words, associated with measures which correspond to a threshold range, as homophones of the uttered word; and
outputting the identified homophones, wherein the user can select an identified homophone that corresponds to the word uttered by the user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
(a) comparing a leaf sequence associated with the decoded word to respective leaf sequences associated with the at least a portion of other existing words; and
(b) generating respective distance measures in response to the comparisons, the respective distance measures indicative of acoustic distances between the compared leaf sequences.
-
-
13. The method of claim 12, wherein the leaf sequence comparison step further comprises performing a best match alignment process between leaf sequences of unequal phonetic length.
-
14. Computer-based apparatus for identifying homophones of a word uttered by a user from at least a portion of a vocabulary associated with a speech recognition system, the speech recognition system includes a speech input processor for receiving the uttered word and a speech recognition engine for decoding the uttered word to generate a decoded word, the apparatus comprising:
-
a processor, operatively coupled to the speech recognition engine, for computing respective measures between the decoded word output from the speech recognition engine and the at least a portion of other existing vocabulary words, wherein the respective measures are indicative of acoustic similarity between the decoded word and the at least a portion of other existing vocabulary words, and wherein the processor identifies the other existing words, associated with measures which correspond to a threshold range, as homophones of the uttered word; and
an output device for presenting the homophones identified by the processor to the user, wherein the user can select an identified homophone that corresponds to the word uttered by the user. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
(a) comparing a leaf sequence associated with the decoded word to respective leaf sequences associated with the at least a portion of other existing words; and
(b) generating respective distance measures in response to the comparisons, the respective distance measures indicative of acoustic distances between the compared leaf sequences.
-
-
24. The apparatus of claim 23, wherein the processor further performs a best match alignment process between leaf sequences of unequal phonetic length.
-
25. The apparatus of claim 17, wherein the processor calculates the respective distance measures via a Kuhlback-Liebler distance metric.
-
26. The apparatus of claim 17, wherein if at least one measure is not within the threshold range, the user confirming, via the input device, that the decoded word is the uttered word.
-
27. A program storage device, readable by a machine, tangibly embodying a program of instructions executable by the machine to perform method steps for identifying homophones of a word uttered by a user from at least a portion of existing words of a vocabulary of a speech recognition engine, the method steps comprising:
-
decoding the uttered word using the speech recognition engine to yield a decoded word;
computing respective measures between the decoded word and at least a portion of the other existing vocabulary words, the respective measures indicative of acoustic similarity between the word and the other existing words;
identifying the other existing words, associated with measures which correspond to a threshold range, as homophones of the uttered word; and
outputting the identified homophones, wherein the user can select an identified homophone that corresponds to the word uttered by the user. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39)
(a) comparing a leaf sequence associated with the decoded word to respective leaf sequences associated with the at least a portion of other existing words; and
(b) generating respective distance measures in response to the comparisons, the respective distance measures indicative of acoustic distances between the compared leaf sequences.
-
-
37. The program storage device of claim 36, wherein the instructions for the leaf sequence comparison step further comprise instructions for performing a best match alignment process between leaf sequences of unequal phonetic length.
-
38. The program storage device of claim 27, wherein the respective distance measures are calculated via a Kuhlback-Liebler distance metric.
-
39. The program storage device of claim 27, further comprising instructions for providing the capability for the user to confirm that the decoded word is the uttered words if at least one measure is not within the threshold range.
Specification