Method, system, and computer readable medium for comparing phonetic similarity of return words to resolve ambiguities during voice recognition
First Claim
1. A method for a speech recognition system to select a return value corresponding to a spoken input, the method comprising:
- (a) generating a dictionary comprising return values preexisting in the speech recognition system;
(b) generating a grammar for each return value in the dictionary;
(c) for each return value in the dictionary, analyzing the grammar to determine a subset of return values from the dictionary that are likely alternatives for the return value, comprising for each string in the grammar for the return value, comparing the string with every other string in the dictionary that is not in the grammar for that return value; and
if said comparison indicates that the strings are related based on one of a phonetic similarity threshold and a synonym relationship then adding the return value associated with the other string to the subset;
(d) selecting a first return value corresponding to the spoken input based on the grammar;
(e) if the first return value is not confirmed by a user, then presenting the return values in the subset for the first return value at once to the user for selection, wherein the user is notified of strings that have a high likelihood of being confused so that the user can make changes to the grammar.
8 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, the invention provides a method for a speech recognition system to select a return value corresponding to a spoken input. The method comprises generating a dictionary comprising return values associated with data provisioned in the speech recognition system; generating a grammar for each return value in the dictionary; analyzing the grammar to determine a subset of return values from the dictionary that are likely alternatives for each return value in the dictionary, based on the grammar; selecting a return value corresponding to the spoken input based on the grammar; and if the selected return value is not confirmed by a user, then presenting the likely alternative for the selected return value to the user.
29 Citations
14 Claims
-
1. A method for a speech recognition system to select a return value corresponding to a spoken input, the method comprising:
-
(a) generating a dictionary comprising return values preexisting in the speech recognition system; (b) generating a grammar for each return value in the dictionary; (c) for each return value in the dictionary, analyzing the grammar to determine a subset of return values from the dictionary that are likely alternatives for the return value, comprising for each string in the grammar for the return value, comparing the string with every other string in the dictionary that is not in the grammar for that return value; and
if said comparison indicates that the strings are related based on one of a phonetic similarity threshold and a synonym relationship then adding the return value associated with the other string to the subset;(d) selecting a first return value corresponding to the spoken input based on the grammar; (e) if the first return value is not confirmed by a user, then presenting the return values in the subset for the first return value at once to the user for selection, wherein the user is notified of strings that have a high likelihood of being confused so that the user can make changes to the grammar. - View Dependent Claims (2, 3, 4)
-
-
5. A speech recognition system, comprising:
-
a memory configured to store logic instructions and a processor configured to execute the logic instructions that when executed cause the; logic instructions to generate a dictionary comprising return values preexisting in the speech recognition system; logic instructions to generate a grammar for each return value in the dictionary; logic instructions to select a return value corresponding to a spoken input; logic instructions to confirm the selected return value with a user; logic instructions to generate a subset of alternative return values for the spoken input, wherein each alternative return value is related to the selected return value based on one of a synonym relationship and a phonetic similarity threshold between grammars for the return value and the alternative return value; and logic instructions to present the alternative return values to the user for selection, wherein the user is notified of at least a string that has a high likelihood of being confused so that the user can make changes to the grammar. - View Dependent Claims (6, 7, 8, 9)
-
-
10. A non-transitory computer-readable medium, having stored thereon a sequence of instructions, which when executed by a computer processor, cause the computer processor to perform a speech recognition algorithm to select a preexisting return value corresponding to a spoken input, the computer processor being further configured to perform:
-
generating a dictionary comprising return values preexisting in the speech recognition system; generating a grammar for each return value in the dictionary; for each return value in the dictionary, analyzing the grammar to determine a subset of return values from the dictionary that are likely alternatives for the return value, comprising for each string in the grammar for the return value, comparing the string with every other string in the dictionary that is not in the grammar for that return value; and
if said comparison indicates that the strings are related based on one of a phonetic similarity threshold and a synonym relationship then adding the return value associated with the other string to the subset;selecting a first return value corresponding to the spoken input based on the grammar; and if the first return value is not confirmed by a user, then presenting the return values in the subset for the first return value at once to the user for selection, wherein the user is notified of strings that have a high likelihood of being confused so that the user can make changes to the grammar.
-
-
11. A speech recognition system, comprising:
-
a processor; and a memory coupled to the processor, the memory storing instructions which when executed by the processor, cause the system to perform a method for selecting a preexisting return value corresponding to a spoken input, the method comprising; generating a dictionary comprising return values preexisting in the speech recognition system; generating a grammar for each return value in the dictionary; for each return value in the dictionary, analyzing the grammar to determine a subset of return values from the dictionary that are likely alternatives for the return value, comprising for each string in the grammar for the return value, comparing the string with every other string in the dictionary that is not in the grammar for that return value; and
if said comparison indicates that the strings are related based on one of a phonetic similarity threshold and a synonym relationship then adding the return value associated with the other string to the subset;selecting a first return value corresponding to the spoken input based on the grammar; if the first return value is not confirmed by a user, then presenting the return values in the subset for the first return value at once to the user for selection, wherein the user is notified of strings that have a high likelihood of being confused so that the user can make changes to the grammar.
-
-
12. A non-transitory computer-readable storage medium, having stored thereon, a sequence of instructions which when executed by a computer processor, cause the computer processor to perform:
-
generating a dictionary comprising return values preexisting in the speech recognition system; generating a grammar for each return value in the dictionary; selecting a return value corresponding to a spoken input; confirming the selected return value with a user; generating a subset of alternative return values for the spoken input, wherein each alternative return value is related to the selected return value based on one of a synonym relationship and a phonetic similarity threshold return value between the grammars for the return value and the alternative return value; and presenting the alternative return values to the user for selection, wherein the user is notified of at least a string that has a high likelihood of being confused so that the user can make changes to the grammar.
-
-
13. A method for a speech recognition system, comprising:
-
generating a dictionary comprising return values preexisting in the speech recognition system; generating a grammar for each return value in the dictionary; selecting a return value corresponding to a spoken input; confirming the selected return value with a user; generating a subset of alternative return values for the spoken input, wherein each alternative return value is related to the selected return value based on one of a synonym relationship and a phonetic similarity threshold between the grammars for the return value and the alternative return value; and presenting the alternative return values to the user for selection, wherein the user is notified of at least a string that has a high likelihood of being confused so that the user can make changes to the grammar. - View Dependent Claims (14)
-
Specification