SYSTEM AND METHOD OF SUPPORTING ADAPTIVE MISRECOGNITION IN CONVERSATIONAL SPEECH
First Claim
1. A system for processing natural language utterances, comprising:
- a multimodal device configured to receive a natural language utterance;
a speech recognition engine configured to recognize one or more words from the natural language utterance;
a parser configured to generate an interpretation of the natural language utterance from the one or more recognized words, and further configured to generate a request based on the interpretation of the natural language utterance;
a domain agent configured to process the generated request; and
an adaptive misrecognition engine configured to;
monitor one or more actions associated with the domain agent processing the request; and
determine whether the interpretation of the natural language utterance is correct or incorrect based on the one or more monitored actions.
5 Assignments
0 Petitions
Accused Products
Abstract
A system and method are provided for receiving speech and/or non-speech communications of natural language questions and/or commands and executing the questions and/or commands. The invention provides a conversational human-machine interface that includes a conversational speech analyzer, a general cognitive model, an environmental model, and a personalized cognitive model to determine context, domain knowledge, and invoke prior information to interpret a spoken utterance or a received non-spoken message. The system and method creates, stores and uses extensive personal profile information for each user, thereby improving the reliability of determining the context of the speech or non-speech communication and presenting the expected results for a particular question or command.
-
Citations
50 Claims
-
1. A system for processing natural language utterances, comprising:
-
a multimodal device configured to receive a natural language utterance; a speech recognition engine configured to recognize one or more words from the natural language utterance; a parser configured to generate an interpretation of the natural language utterance from the one or more recognized words, and further configured to generate a request based on the interpretation of the natural language utterance; a domain agent configured to process the generated request; and an adaptive misrecognition engine configured to; monitor one or more actions associated with the domain agent processing the request; and determine whether the interpretation of the natural language utterance is correct or incorrect based on the one or more monitored actions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A method for processing natural language utterances, comprising:
-
receiving a natural language utterance at a multimodal device; recognizing one or more words from the natural language utterance using a speech recognition engine coupled to the multimodal device; generating an interpretation of the natural language utterance from the one or more recognized words using a parser coupled to the multimodal device, wherein the parser further generates a request based on the interpretation of the natural language utterance; invoking a domain agent configured to process the generated request; monitoring one or more actions associated with the domain agent processing the request using an adaptive misrecognition engine; and determining whether the interpretation of the natural language utterance is correct or incorrect based on the actions monitored using the adaptive misrecognition engine. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50)
-
Specification