Non-target barge-in detection
First Claim
1. A speech recognition system comprising:
- an output device configured to play a speech prompt to a user of the system, the speech prompt conveying information to the user or prompting the user for speech input;
a detector configured to receive a sound signal indicative of a possible target sound and to output a signal representative of the sound signal and a signal indicating receipt of the sound signal, so as to permit interruption of further output of the prompt if the detector receives the sound signal while the output device is playing the prompt;
a recognizer coupled to the detector, the recognizer being configured to receive the signal representative of the sound signal and to output a recognizer signal indicative of whether the sound signal is a target sound; and
a control unit coupled to the output device, the detector, and the recognizer, the control unit being configured to receive the recognizer signal and, if the detector received the sound signal while the output device was playing a prompt and the recognizer signal indicates that the sound signal is other than a target sound, to cause the output device to play at least a portion of the prompt being played by the output device when the sound signal was received by the detector.
4 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition system plays prompts to a user in order to obtain information from the user. If the user begins to speak, the prompt should stop. However, the system may receive sounds other than speech from the user while playing a prompt, in which case the prompt should continue. The system temporarily stops a prompt when it detects a sound or when it preliminarily determines that a detected sound may be a target sound (such as words from the user). The system then determines whether the received sound is a target sound or some other sound (such as coughing or a door shutting). If the received sound is not determined to be a target sound, then the prompt is resumed. The prompt can be resumed at any appropriate point, such as the point where it was stopped, a prior phrase boundary, or the beginning of the prompt.
43 Citations
22 Claims
-
1. A speech recognition system comprising:
-
an output device configured to play a speech prompt to a user of the system, the speech prompt conveying information to the user or prompting the user for speech input;
a detector configured to receive a sound signal indicative of a possible target sound and to output a signal representative of the sound signal and a signal indicating receipt of the sound signal, so as to permit interruption of further output of the prompt if the detector receives the sound signal while the output device is playing the prompt;
a recognizer coupled to the detector, the recognizer being configured to receive the signal representative of the sound signal and to output a recognizer signal indicative of whether the sound signal is a target sound; and
a control unit coupled to the output device, the detector, and the recognizer, the control unit being configured to receive the recognizer signal and, if the detector received the sound signal while the output device was playing a prompt and the recognizer signal indicates that the sound signal is other than a target sound, to cause the output device to play at least a portion of the prompt being played by the output device when the sound signal was received by the detector. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer program product, residing on a computer-readable medium, for use in a speech recognition application configured to play prompts to a user of the application and to receive signals indicative of sounds, the speech recognition application being configured to interrupt at an interruption point a prompt being played when a sound is received by the application, the computer program product comprising instructions for causing a computer to:
-
determine whether a signal indicative of a sound received by the speech recognition application while a prompt is being played is indicative of a target sound; and
cause the speech recognition application to play at least a portion of the prompt being played by the output device when the sound that was received by the speech recognition application is determined to be indicative of a non-target sound. - View Dependent Claims (12, 13, 14, 15)
-
-
16. An interactive speech method comprising:
-
playing a prompt to a user;
detecting a signal indicative of a sound;
in response to detecting the signal indicative of a sound, interrupting the playing of the prompt at an interruption point of the prompt;
determining whether the detected signal is indicative of a target sound; and
resuming play of at least a portion of the prompt in response to determining that the detected signal is indicative of a non-target sound. - View Dependent Claims (17, 18, 19, 20)
-
-
21. A speech recognition system comprising:
-
an output device configured to play speech prompts to a user of the system, the speech prompts conveying information to the user or prompting the user for speech input;
a detector configured to receive a signal indicative of a possible target sound and to output a signal representative of the sound; and
a control unit, coupled to the output device and the detector, programmed to stop the output device from playing the prompt if the detector receives the signal indicative of a possible target sound while the output device is playing the prompt, and to cause the output device to resume playing the prompt if the sound is determined to be a non-target sound. - View Dependent Claims (22)
-
Specification