SYSTEM AND METHOD OF SUPPORTING ADAPTIVE MISRECOGNITION IN CONVERSATIONAL SPEECH
First Claim
1. A system for processing natural language utterances, comprising:
- a multimodal device configured to receive a natural language utterance;
a speech recognition engine configured to recognize one or more words from the natural language utterance;
a parser configured to generate an interpretation of the natural language utterance from the one or more recognized words, and further configured to generate a request based on the interpretation of the natural language utterance;
a domain agent configured to process the generated request; and
an adaptive misrecognition engine configured to monitor one or more actions associated with the domain agent processing the request and determine whether the interpretation of the natural language utterance is correct or incorrect based on the one or more monitored actions.
6 Assignments
0 Petitions
Accused Products
Abstract
A system and method are provided for receiving speech and/or non-speech communications of natural language questions and/or commands and executing the questions and/or commands. The invention provides a conversational human-machine interface that includes a conversational speech analyzer, a general cognitive model, an environmental model, and a personalized cognitive model to determine context, domain knowledge, and invoke prior information to interpret a spoken utterance or a received non-spoken message. The system and method creates, stores, and uses extensive personal profile information for each user, thereby improving the reliability of determining the context of the speech or non-speech communication and presenting the expected results for a particular question or command.
564 Citations
44 Claims
-
1. A system for processing natural language utterances, comprising:
-
a multimodal device configured to receive a natural language utterance; a speech recognition engine configured to recognize one or more words from the natural language utterance; a parser configured to generate an interpretation of the natural language utterance from the one or more recognized words, and further configured to generate a request based on the interpretation of the natural language utterance; a domain agent configured to process the generated request; and an adaptive misrecognition engine configured to monitor one or more actions associated with the domain agent processing the request and determine whether the interpretation of the natural language utterance is correct or incorrect based on the one or more monitored actions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A method for processing natural language utterances, comprising:
-
receiving a natural language utterance at a multimodal device; recognizing one or more words from the natural language utterance using a speech recognition engine coupled to the multimodal device; generating an interpretation of the natural language utterance from the one or more recognized words using a parser coupled to the multimodal device, wherein the parser generates a request based on the interpretation of the natural language utterance; invoking a domain agent configured to process the generated request; monitoring one or more actions associated with the domain agent processing the request using an adaptive misrecognition engine; and determining, at the adaptive misrecognition engine, whether the interpretation of the natural language utterance is correct or incorrect based on the one or more monitored actions. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44)
-
Specification