Recognizing multiple semantic items from single utterance
First Claim
Patent Images
1. A method to be executed at least in part in a computing device for recognizing multiple semantic items from a single utterance, the method comprising:
- receiving a single utterance including at least two semantically distinct items from a user;
performing a speech recognition operation on the single utterance to recognize a first item of the at least two semantically distinct items;
determining a constraint based on the recognition of the first item; and
performing another speech recognition operation on the single utterance to recognize a second item of the at least two semantically distinct items based on the determined constraint.
2 Assignments
0 Petitions
Accused Products
Abstract
Semantically distinct items are extracted from a single utterance by repeatedly recognizing the same utterance using constraints provided by semantic items already recognized. User feedback for selection or correction of partially recognized utterance may be used in a hierarchical, multi-modal, or single step manner. An accuracy of recognition is preserved while the less structured and more natural single utterance recognition form is allowed to be used.
61 Citations
20 Claims
-
1. A method to be executed at least in part in a computing device for recognizing multiple semantic items from a single utterance, the method comprising:
-
receiving a single utterance including at least two semantically distinct items from a user; performing a speech recognition operation on the single utterance to recognize a first item of the at least two semantically distinct items; determining a constraint based on the recognition of the first item; and performing another speech recognition operation on the single utterance to recognize a second item of the at least two semantically distinct items based on the determined constraint. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computing device for recognizing multiple semantic items from a single utterance, the computing device comprising:
-
a memory; a processor coupled to the memory, the processor capable of executing a first application for speech recognition and a second application for consuming results of the speech recognition, wherein the first application is configured to; receive a single utterance including at least two semantically distinct items from a user, the semantically distinct items comprising at least one from a set of;
words, phrases, and fragments;process the single utterance to recognize a first item of the at least two semantically distinct items; provide the recognized first item to the user for one of confirmation and correction; receive one of the user correction and confirmation for the first item; determine a specific language model based on the first item; and process the single utterance again to recognize a second item of the at least two semantically distinct items applying the specific language model; and wherein the second application is configured to; in response to consuming the first item, provide input to the first application for the specific language model; and in response to consuming the second item, provide feedback to the user based on a combination of the first and second items. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A computer-readable storage medium with instructions stored thereon for recognizing multiple semantic items from a single utterance, the instructions comprising:
-
receiving a single utterance including a plurality of semantically distinct items from a user; performing a plurality of speech recognition operations on the single utterance to recognize one of the plurality of semantically distinct items during each operation, wherein the semantically distinct items are hierarchically related, and wherein a statistical language model for recognizing one of the plurality of semantically distinct items is determined based on a prior recognition during each operation; providing each recognized item to the user for one of confirmation and correction between recognition operations; upon receiving one of user correction and confirmation, providing the recognized plurality of semantically distinct items to a web-based search application; receiving input from the web-based search application; providing the received input to the user. - View Dependent Claims (19, 20)
-
Specification