User dedicated automatic speech recognition
First Claim
Patent Images
1. A device for automatic speech recognition (ASR) comprising:
- a multi-mode voice-controlled user interface employing at least one hardware implemented computer processor, wherein the user interface is adapted to conduct a speech dialog with one or more possible speakers and includes;
a broad listening mode which accepts speech inputs from the possible speakers without spatial filtering and has an associated limited broad mode recognition vocabulary; and
a selective listening mode which limits speech inputs to a specific speaker using spatial filtering and has an associated selective mode recognition vocabulary that is larger than the limited broad mode recognition vocabulary,wherein the user interface is adapted to;
switch from the broad listening mode to the selective listening mode in response to one or more switching cues,in the selective listening mode, engage the specific speaker in a dialog using the selective mode recognition vocabulary, andthe user interface is adapted to remain in the selective listening mode so long as a location of the specific speaker is known.
1 Assignment
0 Petitions
Accused Products
Abstract
A multi-mode voice controlled user interface is described. The user interface is adapted to conduct a speech dialog with one or more possible speakers and includes a broad listening mode which accepts speech inputs from the possible speakers without spatial filtering, and a selective listening mode which limits speech inputs to a specific speaker using spatial filtering. The user interface switches listening modes in response to one or more switching cues.
26 Citations
20 Claims
-
1. A device for automatic speech recognition (ASR) comprising:
-
a multi-mode voice-controlled user interface employing at least one hardware implemented computer processor, wherein the user interface is adapted to conduct a speech dialog with one or more possible speakers and includes; a broad listening mode which accepts speech inputs from the possible speakers without spatial filtering and has an associated limited broad mode recognition vocabulary; and a selective listening mode which limits speech inputs to a specific speaker using spatial filtering and has an associated selective mode recognition vocabulary that is larger than the limited broad mode recognition vocabulary, wherein the user interface is adapted to; switch from the broad listening mode to the selective listening mode in response to one or more switching cues, in the selective listening mode, engage the specific speaker in a dialog using the selective mode recognition vocabulary, and the user interface is adapted to remain in the selective listening mode so long as a location of the specific speaker is known. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer program product encoded in a non-transitory computer-readable medium for operating an automatic speech recognition (ASR) system, the product comprising:
-
program code executable to conduct a speech dialog with one or more possible speakers via a multi-mode voice-controlled user interface adapted to; accept speech inputs from the possible speakers in a broad listening mode without spatial filtering, the broad listening mode having an associated limited broad mode recognition vocabulary; and limit speech inputs to a specific speaker in a selective listening mode using spatial filtering, the selective listening mode having an associated selective mode recognition vocabulary that is larger than the limited broad mode recognition vocabulary, wherein the program code is executable to cause the user interface to; switch from the broad listening mode to the selective listening mode in response to one or more switching cues, in the selective listening mode, engage the specific speaker in a dialog using the selective mode recognition vocabulary, and the program code is executable to cause the user interface to remain in the selective listening mode so long as a location of the specific speaker is known. - View Dependent Claims (11)
-
-
12. A method for automatic speech recognition (ASR) comprising:
-
employing a multi-mode voice-controlled user interface having a computer processor to conduct a speech dialog with one or more possible speakers by; employing a broad listening mode which accepts speech inputs from the possible speakers without spatial filtering and has an associated limited broad mode recognition vocabulary; and employing a selective listening mode which limits speech inputs to a specific speaker using spatial filtering and has an associated selective mode recognition vocabulary that is larger than the limited broad mode recognition vocabulary, the user interface; switching from the broad listening mode to the selective listening mode in response to one or more switching cues, in the selective listening mode, engaging the specific speaker in a dialog using the selective mode recognition vocabulary, and remaining in the selective listening mode so long as a location of the specific speaker is known. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification