Homonym processing in the context of voice-activated command systems
1 Assignment
0 Petitions
Accused Products
Abstract
A method is disclosed from constructing a grammar. The grammar is configured to be processed by a speech recognition engine in the context of a voice-activated command system. The method includes receiving a database containing a plurality of terms. From the plurality of terms, first and second terms are identified. The first and second terms are spelled differently but have a first pronunciation in common. One of the first and second terms also has a second pronunciation that is not inherent to the other of the first and second terms. The first and second pronunciations are placed within the grammar.
68 Citations
35 Claims
-
1-20. -20. (canceled)
-
21. A method for constructing a grammar to be processed by a speech recognition engine in the context of a voice-activated command system, the method comprising:
-
receiving a database containing a plurality of terms;
identifying from said plurality a first term and a second term that are spelled differently but have a first pronunciation in common, wherein one of the first and second terms also has a second pronunciation that is not inherent to the other of the first and second terms; and
placing the first and second pronunciations within the grammar. - View Dependent Claims (22, 23, 24, 25, 26)
-
-
27. A computer-implemented method for accomplishing disambiguation in the context of a voice-dialing system, the method comprising:
-
providing an input to a speech recognition engine for processing relative to a grammar that corresponds to a database containing a plurality of terms;
including in the grammar a pair of pronunciations that correspond to a pair of terms from said plurality that are spelled differently, one of said pronunciations being a pronunciation shared by the pair of terms and the other being unique to one of the pair of terms;
receiving from the speech recognition engine an output corresponding to one of said pair of pronunciations;
utilizing, based at least in part on the output, either both or one of the pair of terms as a basis for disambiguation. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34)
-
-
35. A context free grammar configured to be processed by a speech recognition engine in the context of a voice-activated command system, comprising:
a representation of a plurality of database terms including a pair of pronunciations that correspond to a pair of terms from said plurality that are spelled differently, one of said pronunciations being a pronunciation shared by the pair of terms and the other being unique to one of the pair of terms.
Specification