Device, Method, and Program for Performing Interaction Between User and Machine
First Claim
1. A device for performing interaction between a user and a machine, the device having a plurality of domains corresponding to multiple phases of the interaction, each domain including speech understanding means for understanding content of a speech of the user and outputting a speech understanding result, the device comprising:
- means for recognizing the speech of the user from a signal detected by a microphone;
means for delivering the speech of the user to the respective speech understanding means, receiving the speech understanding result from each of the speech understanding means and selecting, as a relevant domain, the domain having the speech understanding means that produced an optimum speech understanding result;
means for referring to a task knowledge of the relevant domain out of task knowledges included in the respective multiple domains, and extracting a task associated with the speech understanding result;
means for referring to a subtask knowledge including a plurality of subtasks associated with the kind of task, and determining a sequence of subtasks associated with the extracted task;
means for determining a first subtask of the sequence of subtasks as a relevant subtask and updating, as a relevant domain, the domain to which the relevant subtask belongs;
means for referring to an action knowledge of the relevant domain out of action knowledges included in the respective multiple domains, and extracting a subtask completion flag or an action associated with the speech understanding result and the subtask; and
means for causing the machine to execute the extracted action.
1 Assignment
0 Petitions
Accused Products
Abstract
There is provided a device for performing interaction between a user and a machine. The device includes a plurality of domains corresponding to a plurality of stages in the interaction. Each of the domains has voice comprehension means which understands the content of the user'"'"'s voice. The device includes: means for recognizing the user'"'"'s voice; means for selecting a domain enabling the best voice comprehension result as the domain; means for referencing task knowledge of the domain and extracting a task
-
Citations
15 Claims
-
1. A device for performing interaction between a user and a machine, the device having a plurality of domains corresponding to multiple phases of the interaction, each domain including speech understanding means for understanding content of a speech of the user and outputting a speech understanding result, the device comprising:
-
means for recognizing the speech of the user from a signal detected by a microphone; means for delivering the speech of the user to the respective speech understanding means, receiving the speech understanding result from each of the speech understanding means and selecting, as a relevant domain, the domain having the speech understanding means that produced an optimum speech understanding result; means for referring to a task knowledge of the relevant domain out of task knowledges included in the respective multiple domains, and extracting a task associated with the speech understanding result; means for referring to a subtask knowledge including a plurality of subtasks associated with the kind of task, and determining a sequence of subtasks associated with the extracted task; means for determining a first subtask of the sequence of subtasks as a relevant subtask and updating, as a relevant domain, the domain to which the relevant subtask belongs; means for referring to an action knowledge of the relevant domain out of action knowledges included in the respective multiple domains, and extracting a subtask completion flag or an action associated with the speech understanding result and the subtask; and means for causing the machine to execute the extracted action. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for performing interaction between a user and a machine, comprising the steps of:
-
recognizing a speech of the user from a signal detected by a microphone; delivering the speech of the user to a plurality of domains corresponding to multiple phases of the interaction; understanding content of the speech in each of the plurality of domains and outputting a speech understanding result; receiving the speech understanding results respectively from the plurality of domains; selecting, as a relevant domain, the domain that produced an optimum speech understanding result out of the plurality of speech understanding results; referring to a task knowledge of the relevant domain out of task knowledges included in the respective multiple domains, and extracting a task associated with the speech understanding result; referring to a subtask knowledge including a plurality of subtasks associated with the kind of task, and determining a sequence of subtasks associated with the extracted task; determining a first subtask of the sequence of subtasks as a relevant subtask and updating, as a relevant domain, the domain to which the relevant subtask belongs; referring to an action knowledge of the relevant domain out of action knowledges included in the respective multiple domains, and extracting a subtask completion flag or an action associated with the speech understanding result and the subtask; and causing the machine to execute the extracted action. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A computer readable recording media storing a computer program for performing interaction between a user and a machine, the program causing a computer to perform the functions of:
-
recognizing a speech of the user from a signal detected by a microphone; delivering the speech of the user to a plurality of domains corresponding to multiple steps of the interaction with the user; understanding content of the speech in each of the plurality of domains and outputting a speech understanding result; receiving the speech understanding results respectively from the plurality of domains; selecting, as a relevant domain, the domain having an optimum speech understanding result out of the plurality of speech understanding results; referring to a task knowledge of the relevant domain out of task knowledges included in the respective multiple domains, and extracting a task associated with the speech understanding result; referring to a subtask knowledge including a plurality of subtasks associated with the kind of task, and determining a sequence of subtasks associated with the extracted task; determining a first subtask of the sequence of subtasks as a relevant subtask and updating, as a relevant domain, the domain to which the relevant subtask belongs; referring to an action knowledge of the relevant domain out of action knowledges included in the respective multiple domains, and extracting a subtask completion flag or an action associated with the speech understanding result and the subtask; and causing the machine to execute the extracted action, the program being recorded on a recording medium readable by the computer. - View Dependent Claims (12, 13, 14, 15)
-
Specification