Homonym processing in the context of voice-activated command systems
First Claim
1. A method for constructing a grammar to be processed by a speech recognition engine in the context of a voice-activated command system, the method comprising:
- receiving a database containing a plurality of terms;
identifying from said plurality a first term and a second term that are spelled differently but have a first pronunciation in common;
whereinidentifying that one of the first and second terms also has a second pronunciation that is not inherent to the other of the first and second terms; and
placing the first and second pronunciations within the grammar.
1 Assignment
0 Petitions
Accused Products
Abstract
A method is disclosed from constructing a grammar. The grammar is configured to be processed by a speech recognition engine in the context of a voice-activated command system. The method includes receiving a database containing a plurality of terms. From the plurality of terms, first and second terms are identified. The first and second terms are spelled differently but have a first pronunciation in common. One of the first and second terms also has a second pronunciation that is not inherent to the other of the first and second terms. The first and second pronunciations are placed within the grammar.
62 Citations
15 Claims
-
1. A method for constructing a grammar to be processed by a speech recognition engine in the context of a voice-activated command system, the method comprising:
-
receiving a database containing a plurality of terms; identifying from said plurality a first term and a second term that are spelled differently but have a first pronunciation in common;
whereinidentifying that one of the first and second terms also has a second pronunciation that is not inherent to the other of the first and second terms; and placing the first and second pronunciations within the grammar. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-implemented method for accomplishing disambiguation in the context of a voice-dialing system, the method comprising:
-
providing an input to a speech recognition engine for processing relative to a grammar that corresponds to a database containing a plurality of terms; including in the grammar a pair of pronunciations that correspond to a pair of terms from said plurality that are spelled differently, one of said pronunciations being a pronunciation shared by the pair of terms and the other being unique to one term in the the pair of terms; receiving from the speech recognition engine an output corresponding to one pronunciation from the pair of pronunciations; determining, based at least in part on the output, whether or not to use both in the pair of terms as a basis for disambiguation; and utilizing both or one of the pair of terms as a basis for disambiguation. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14)
-
-
15. A speech recognition system comprising:
-
a context free grammar that includes a representation of a plurality of database terms including a pair of pronunciations that correspond to a pair of terms from said plurality that are spelled differently, one of said pronunciations being a pronunciation shared by the pair of terms and the other being unique to one of the pair of terms; and a speech recognition engine that utilizes the context free grammar as a basis for identifying a voice-activated command.
-
Specification