Automatic speech recognition with a selection list
First Claim
1. A method of automatic speech recognition (‘
- ASR’
), the method implemented with a speech recognition grammar of a multimodal application, with the multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and a visual mode, the multimodal application operatively coupled to a grammar interpreter and configured to enable a user of the multimodal application to select or deselect multiple items in a selection list using a single utterance, the method comprising;
accepting, by the multimodal application, speech input corresponding to the single utterance for selecting or deselecting one or more items in the selection list;
providing, from the multimodal application to the grammar interpreter, the speech input and a speech recognition grammar associated with the selection list;
receiving, by the multimodal application from the grammar interpreter, interpretation results, the interpretation results including at least one matched word from the grammar that identifies at least one item in the selection list and a separate indication of whether to select or deselect the at least one item in the selection list, wherein the separate indication is based, at least in part, on the speech input; and
selecting or deselecting based, at least in part, on the separate indication, the at least one item in the selection list that corresponds to the at least one matched word.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods, apparatus, and computer program products are described for automatic speech recognition (‘ASR’) that include accepting by the multimodal application speech input and visual input for selecting or deselecting items in a selection list, the speech input enabled by a speech recognition grammar; providing, from the multimodal application to the grammar interpreter, the speech input and the speech recognition grammar; receiving, by the multimodal application from the grammar interpreter, interpretation results including matched words from the grammar that correspond to items in the selection list and a semantic interpretation token that specifies whether to select or deselect items in the selection list; and determining, by the multimodal application in dependence upon the value of the semantic interpretation token, whether to select or deselect items in the selection list that correspond to the matched words.
-
Citations
18 Claims
-
1. A method of automatic speech recognition (‘
- ASR’
), the method implemented with a speech recognition grammar of a multimodal application, with the multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and a visual mode, the multimodal application operatively coupled to a grammar interpreter and configured to enable a user of the multimodal application to select or deselect multiple items in a selection list using a single utterance, the method comprising;accepting, by the multimodal application, speech input corresponding to the single utterance for selecting or deselecting one or more items in the selection list; providing, from the multimodal application to the grammar interpreter, the speech input and a speech recognition grammar associated with the selection list; receiving, by the multimodal application from the grammar interpreter, interpretation results, the interpretation results including at least one matched word from the grammar that identifies at least one item in the selection list and a separate indication of whether to select or deselect the at least one item in the selection list, wherein the separate indication is based, at least in part, on the speech input; and selecting or deselecting based, at least in part, on the separate indication, the at least one item in the selection list that corresponds to the at least one matched word. - View Dependent Claims (2, 3, 4, 5, 6)
- ASR’
-
7. Apparatus for automatic speech recognition (‘
- ASR’
) for use with a speech recognition grammar of a multimodal application, with the multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and a visual mode, the multimodal application operatively coupled to a grammar interpreter and configured to enable a user of the multimodal application to select or deselect multiple items in a selection list using a single utterance, the apparatus comprising;a computer processor; and a computer memory operatively coupled to the computer processor, the computer memory storing a computer program that, when executed by the computer processor, performs a method comprising; accepting by the multimodal application speech input corresponding to the single utterance for selecting or deselecting one or more items in the selection list; providing, from the multimodal application to the grammar interpreter, the speech input and a speech recognition grammar associated with the selection list; receiving, by the multimodal application from the grammar interpreter, interpretation results, the interpretation results including at least one matched word from the grammar that identifies at least one item in the selection list and a separate indication of whether to select or deselect the at least one item in the selection list, wherein the separate indication is based, at least in part, on the speech input; and selecting or deselecting, based, at least in part, on the separate indication, the at least one item in the selection list that corresponds to the at least one matched word. - View Dependent Claims (8, 9, 10, 11, 12)
- ASR’
-
13. A computer-readable recordable medium encoded with a plurality of instructions that, when executed by a computer, perform a method comprising:
-
accepting, by a multimodal application, speech input for incrementally selecting or deselecting at least one item in a selection list, wherein the speech input includes an indication of whether to select or deselect the at least one item; providing, from the multimodal application to a grammar interpreter, the speech input and a speech recognition grammar associated with the selection list; receiving, by the multimodal application from the grammar interpreter, interpretation results, the interpretation results including at least one matched word from the grammar that identifies at least one item in the selection list and a separate indication of whether to select or deselect the at least one item in the selection list, wherein the separate indication is based, at least in part, on the indication in the speech input of whether to select or deselect the at least one item; and selecting or deselecting based, at least in part, on the separate indication, the at least one item in the selection list that corresponds to the at least one matched word without first deselecting all previously selected items in the selection list. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification