Method and system for preventing error amplification in natural language dialogues
First Claim
Patent Images
1. A method of allowing a user to barge-in over prompts, the method comprising the steps performed by a machine of:
- playing a prompt;
detecting an initial user speech from the user;
determining whether the user speech restarts within a predetermined amount of time;
upon a determination that the user speech restarts, using the user speech after the restart for recognition purposes; and
upon a determination that the user speech does not restart, using the initial user speech.
5 Assignments
0 Petitions
Accused Products
Abstract
A method and system for allowing a user to interface to an interactive voice response system via natural language commands. The method and apparatus transitions users to a known state upon poor recognition results, transitions users to a lateral state upon entry of a command corresponding to a lateral state, and provides intermediate barge-in states to account for user speech patterns
69 Citations
45 Claims
-
1. A method of allowing a user to barge-in over prompts, the method comprising the steps performed by a machine of:
-
playing a prompt;
detecting an initial user speech from the user;
determining whether the user speech restarts within a predetermined amount of time;
upon a determination that the user speech restarts, using the user speech after the restart for recognition purposes; and
upon a determination that the user speech does not restart, using the initial user speech.
-
-
2. A method of allowing a user to barge-in over prompts, the method comprising the steps performed by a machine of:
-
playing a prompt;
determining whether the user has begun speaking, resulting in an initial user speech;
upon a determination that the user has begun speaking, generating a first recognition result based upon the initial user speech;
determining whether the user speech restarts within a predetermined amount of time, resulting in a second user speech;
upon a determination that the user speech restarts, performing the substeps of;
generating a second recognition result based upon the second user speech; and
reporting the second recognition result; and
upon a determination that the user speech does not restart, reporting the first recognition result. - View Dependent Claims (3)
-
-
4. A method of allowing a user to barge-in over prompts, the method comprising the steps performed by a machine of:
-
playing a prompt;
determining whether the user has begun speaking, resulting in an initial user speech;
upon a determination that the user has begun speaking, performing the substeps of;
halting the playing of the prompt; and
generating a first recognition result based upon the initial user speech;
determining whether the user speech restarts within a predetermined amount of time, resulting in a second user speech;
upon a determination that the user speech restarts, performing the substeps of;
generating a second recognition result based upon the second user speech;
determining whether the first recognition result or the second recognition result is to be used;
upon a determination that the first recognition result is to be used, reporting the first recognition result; and
upon a determination that the second recognition result is to be used, reporting the second recognition result; and
upon a determination that the user speech does not restart, using the first recognition result. - View Dependent Claims (5, 6, 7)
-
-
8. A method of providing a voice interface between a machine and a user, the method comprising the steps performed by the machine of:
-
playing a prompt;
receiving a voice command from the user indicating a preferred option;
performing a voice recognition algorithm on the voice command, the voice recognition algorithm returning a recognized command and a confidence value;
determining whether the recognized command corresponds to a valid function and whether the confidence value is acceptable;
upon a determination that the recognized command corresponds to a valid function and that the confidence value is acceptable, performing the actions corresponding to the recognized command;
upon a determination that the recognized command does not correspond to a valid function or that the confidence value is unacceptable, playing an alternative prompt for providing an alternative method for the user to enter a valid command;
determining whether the voice recognition algorithm failed to return a valid recognized command and an acceptable confidence value after a plurality of consecutive attempts; and
upon a determination that the voice recognition algorithm failed to return a valid recognized command and an acceptable confidence value after a plurality of consecutive attempts, transitioning into a known state.
-
-
9. A method of providing a voice interface between a machine and a user, the method comprising the steps performed by the machine of:
-
playing a prompt;
receiving a voice command from the user indicating a preferred option;
determining if the voice command is essentially a single-token command;
upon a determination that the voice command is essentially a single-token command, playing a multi-token prompt having the single-token command; and
upon a determination that the voice command is not essentially a single-token command, performing the voice command. - View Dependent Claims (10)
-
-
11. A method of determining a grammar of an interaction state of an interactive voice response system, the method comprising the steps of:
-
initializing the grammar with a first set of models corresponding to valid actions of the interaction state;
adding to the grammar a second set of models corresponding to valid actions of a lateral state; and
adding to the grammar a third set of models corresponding to valid actions of an entry state. - View Dependent Claims (12, 13, 14)
-
-
15. A method of determining a grammar of an interaction state of an interactive voice response system, the method comprising the steps of:
-
initializing the grammar with a first set of models corresponding to valid actions of the interaction state;
adding to the grammar a second set of models comprising a subset of valid actions of a lateral state, the subset of valid actions being resistant to out-of-grammar errors; and
adding to the grammar a third set of models comprising a subset of valid actions of an entry state, the subset of valid actions of the entry state being resistant to out-of-grammar errors corresponding to valid actions of the entry state.
-
-
16. An apparatus for allowing a user to barge-in over prompts, the apparatus comprising:
-
means for playing a prompt;
means for detecting an initial user speech from the user;
means for determining whether the user speech restarts within a predetermined amount of time;
means for, upon a determination that the user speech restarts, using the user speech after the restart for recognition purposes; and
means for, upon a determination that the user speech does not restart, using the initial user speech.
-
-
17. An apparatus for allowing a user to barge-in over prompts, the apparatus comprising:
-
means for playing a prompt;
means for determining whether the user has begun speaking, resulting in an initial user speech;
means for, upon a determination that the user has begun speaking, generating a first recognition result based upon the initial user speech;
means for determining whether the user speech restarts within a predetermined amount of time, resulting in a second user speech;
means for, upon a determination that the user speech restarts, generating a second recognition result based upon the second user speech, and reporting the second recognition result; and
means for, upon a determination that the user speech does not restart, reporting the first recognition result. - View Dependent Claims (18)
-
-
19. An apparatus for allowing a user to barge-in over prompts, the apparatus comprising:
-
means for playing a prompt;
means for determining whether the user has begun speaking, resulting in an initial user speech;
means for, upon a determination that the user has begun speaking, halting the playing of the prompt, and generating a first recognition result based upon the initial user speech;
means for determining whether the user speech restarts within a predetermined amount of time, resulting in a second user speech;
means for, upon a determination that the user speech restarts, generating a second recognition result based upon the second user speech, determining whether the first recognition result or the second recognition result is to be used, reporting the first recognition result upon a determination that the first recognition result is to be used, and reporting the second recognition result upon a determination that the second recognition result is to be used; and
means for, upon a determination that the user speech does not restart, using the first recognition result. - View Dependent Claims (20, 21, 22)
-
-
23. An apparatus for providing a voice interface between a machine and a user, the apparatus comprising:
-
means for playing a prompt;
means for receiving a voice command from the user indicating a preferred option;
means for performing a voice recognition algorithm on the voice command, the voice recognition algorithm returning a recognized command and a confidence value;
means for determining whether the recognized command corresponds to a valid function and whether the confidence value is acceptable;
means for, upon a determination that the recognized command corresponds to a valid function and that the confidence value is acceptable, performing the actions corresponding to the recognized command;
means for, upon a determination that the recognized command does not correspond to a valid function or that the confidence value is unacceptable, playing an alternative prompt for providing an alternative method for the user to enter a valid command;
means for determining whether the voice recognition algorithm failed to return a valid recognized command and an acceptable confidence value after a plurality of consecutive attempts; and
means for, upon a determination that the voice recognition algorithm failed to return a valid recognized command and an acceptable confidence value after a plurality of consecutive attempts, transitioning into a known state.
-
-
24. An apparatus for providing a voice interface between a machine and a user, the apparatus comprising:
-
means for playing a prompt;
means for receiving a voice command from the user indicating a preferred option;
means for determining if the voice command is essentially a single-token command;
means for, upon a determination that the voice command is essentially a single-token command, playing a multi-token prompt having the single-token command; and
means for, upon a determination that the voice command is not essentially a single-token command, performing the voice command. - View Dependent Claims (25)
-
-
26. An apparatus for determining a grammar of an interaction state of an interactive voice response system, the apparatus comprising:
-
means for initializing the grammar with a first set of models corresponding to valid actions of the interaction state;
means for adding to the grammar a second set of models corresponding to valid actions of a lateral state; and
means for adding to the grammar a third set of models corresponding to valid actions of an entry state. - View Dependent Claims (27, 28, 29)
-
-
30. An apparatus for determining a grammar of an interaction state of an interactive voice response system, the apparatus comprising:
-
means for initializing the grammar with a first set of models corresponding to valid actions of the interaction state;
means for adding to the grammar a second set of models comprising a subset of valid actions of a lateral state, the subset of valid actions being resistant to out-of-grammar errors; and
means for adding to the grammar a third set of models comprising a subset of valid actions of an entry state, the subset of valid actions of the entry state being resistant to out-of-grammar errors corresponding to valid actions of the entry state.
-
-
31. A computer program product for allowing a user to barge-in over prompts, the computer program product having a medium with a computer program embodied thereon, the computer program comprising:
-
computer program code for playing a prompt;
computer program code for detecting an initial user speech from the user;
computer program code for determining whether the user speech restarts within a predetermined amount of time;
computer program code for, upon a determination that the user speech restarts, using the user speech after the restart for recognition purposes; and
computer program code for, upon a determination that the user speech does not restart, using the initial user speech.
-
-
32. A computer program product for allowing a user to barge-in over prompts, the computer program product having a medium with a computer program embodied thereon, the computer program comprising:
-
computer program code for playing a prompt;
computer program code for determining whether the user has begun speaking, resulting in an initial user speech;
computer program code for, upon a determination that the user has begun speaking, generating a first recognition result based upon the initial user speech;
computer program code for determining whether the user speech restarts within a predetermined amount of time, resulting in a second user speech;
computer program code for, upon a determination that the user speech restarts, generating a second recognition result based upon the second user speech, and reporting the second recognition result; and
computer program code for, upon a determination that the user speech does not restart, reporting the first recognition result. - View Dependent Claims (33)
-
-
34. A computer program product for allowing a user to barge-in over prompts, the computer program product having a medium with a computer program embodied thereon, the computer program comprising:
-
computer program code for playing a prompt;
computer program code for determining whether the user has begun speaking, resulting in an initial user speech;
computer program code for, upon a determination that the user has begun speaking, halting the playing of the prompt, and generating a first recognition result based upon the initial user speech;
computer program code for determining whether the user speech restarts within a predetermined amount of time, resulting in a second user speech;
computer program code for, upon a determination that the user speech restarts, generating a second recognition result based upon the second user speech, determining whether the first recognition result or the second recognition result is to be used, reporting the first recognition result upon a determination that the first recognition result is to be used, and reporting the second recognition result upon a determination that the second recognition result is to be used; and
computer program code for, upon a determination that the user speech does not restart, using the first recognition result. - View Dependent Claims (35, 36, 37)
-
-
38. A computer program product for providing a voice interface between a machine and a user, the computer program product having a medium with a computer program embodied thereon, the computer program comprising:
-
computer program code for playing a prompt;
computer program code for receiving a voice command from the user indicating a preferred option;
computer program code for performing a voice recognition algorithm on the voice command, the voice recognition algorithm returning a recognized command and a confidence value;
computer program code for determining whether the recognized command corresponds to a valid function and whether the confidence value is acceptable;
computer program code for, upon a determination that the recognized command corresponds to a valid function and that the confidence value is acceptable, performing the actions corresponding to the recognized command;
computer program code for, upon a determination that the recognized command does not correspond to a valid function or that the confidence value is unacceptable, playing an alternative prompt for providing an alternative method for the user to enter a valid command;
computer program code for determining whether the voice recognition algorithm failed to return a valid recognized command and an acceptable confidence value after a plurality of consecutive attempts; and
computer program code for, upon a determination that the voice recognition algorithm failed to return a valid recognized command and an acceptable confidence value after a plurality of consecutive attempts, transitioning into a known state.
-
-
39. A computer program product for providing a voice interface between a machine and a user, the computer program product having a medium with a computer program embodied thereon, the computer program comprising:
-
computer program code for playing a prompt;
computer program code for receiving a voice command from the user indicating a preferred option;
computer program code for determining if the voice command is essentially a single-token command;
computer program code for, upon a determination that the voice command is essentially a single-token command, playing a multi-token prompt having the single-token command; and
computer program code for, upon a determination that the voice command is not essentially a single-token command, performing the voice command.
-
-
40. The computer program product of claim 40, further comprising computer program code for, upon a determination that the voice command is not essentially a single-token command, playing a success prompt.
-
41. A computer program product for determining a grammar of an interaction state of an interactive voice response system, the computer program product having a medium with a computer program embodied thereon, the computer program comprising:
-
computer program code for initializing the grammar with a first set of models corresponding to valid actions of the interaction state;
computer program code for adding to the grammar a second set of models corresponding to valid actions of a lateral state; and
computer program code for adding to the grammar a third set of models corresponding to valid actions of an entry state. - View Dependent Claims (42, 43, 44)
-
-
45. A computer program product for determining a grammar of an interaction state of an interactive voice response system, the computer program product having a medium with a computer program embodied thereon, the computer program comprising:
-
computer program code for initializing the grammar with a first set of models corresponding to valid actions of the interaction state;
computer program code for adding to the grammar a second set of models comprising a subset of valid actions of a lateral state, the subset of valid actions being resistant to out-of-grammar errors; and
computer program code for adding to the grammar a third set of models comprising a subset of valid actions of an entry state, the subset of valid actions of the entry state being resistant to out-of-grammar errors.
-
Specification