Method and apparatus for reducing latency in speech-based applications
First Claim
1. A computer-readable medium having computer-executable instructions for performing steps of:
- receiving at least one instruction to add a new phrase and semantic information associated with the new phrase to a grammar;
combining the new phrase with at least one other phrase in the grammar to form a single grammar structure;
associating the semantic information with a single recognition value in the new phrase by selecting the first possible recognition value in the new phrase that can be associated with the semantic information without introducing semantic ambiguity into the grammar structure.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition interface and computer-readable medium build a grammar for speech recognition that reduces latency in speech-based applications. The interface and medium receive instructions to add a new phrase and semantic information to a grammar. The new phrase is combined with at least one other phrase in the grammar to form a composite grammar structure. The semantic information is then associated with a single word or transition in the grammar structure by selecting the first possible word or transition that semantically differentiates the new phrase from all other phrases in the grammar structure. By placing the semantic information in this position, the semantic information is placed as far forward in the grammar as possible without introducing semantic ambiguity into the grammar structure.
25 Citations
24 Claims
-
1. A computer-readable medium having computer-executable instructions for performing steps of:
-
receiving at least one instruction to add a new phrase and semantic information associated with the new phrase to a grammar;
combining the new phrase with at least one other phrase in the grammar to form a single grammar structure;
associating the semantic information with a single recognition value in the new phrase by selecting the first possible recognition value in the new phrase that can be associated with the semantic information without introducing semantic ambiguity into the grammar structure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20)
-
-
13. A speech recognition interface for a speech recognition engine, the interface comprising:
-
a grammar structure building component that receives instructions to add transitions to a grammar structure and in response adds transitions to a grammar structure; and
a semantic placement component that places semantic information associated with a current sequence of transitions on a first transition in the grammar structure that differentiates the current sequence of transitions from all other sequences of transitions in the grammar structure.
-
-
21. A computer-readable medium having computer-executable instructions representing a speech-enabled application capable of performing steps of:
-
receiving semantic information associated with a hypothesis word identified by a speech recognition engine before an entire utterance has been recognized;
identifying a possible action to take based on the semantic information; and
determining whether to take the action before the entire utterance has been recognized. - View Dependent Claims (22, 23, 24)
-
Specification