Disambiguating a speech recognition grammar in a multimodal application
First Claim
1. A method of disambiguating a speech recognition grammar in a multimodal application, the multimodal application including voice activated hyperlinks, the voice activated hyperlinks being voice enabled by a speech recognition grammar comprising ambiguous terminal grammar elements, the multimodal application being operable in a multimodal browser on a multimodal device supporting multiple modes of user interaction with the multimodal device, the modes of user interaction including a voice mode and a visual mode, the multimodal browser being operatively coupled to a grammar interpreter, the method comprising:
- maintaining by the multimodal browser a record of visibility of each voice activated hyperlink, the record of visibility including current visibility and past visibility on a display of the multimodal device of each voice activated hyperlink, the record of visibility further including an ordinal indication, for each voice activated hyperlink scrolled off display, of the sequence in which each such voice activated hyperlink was scrolled off display;
recognizing by the multimodal browser speech from a user matching an ambiguous terminal element of the speech recognition grammar; and
selecting by the multimodal browser a voice activated hyperlink for activation, the selecting being carried out in dependence upon the recognized speech and the record of visibility.
3 Assignments
0 Petitions
Accused Products
Abstract
Disambiguating a speech recognition grammar in a multimodal application, the multimodal application including voice activated hyperlinks, the voice activated hyperlinks voice enabled by a speech recognition grammar characterized by ambiguous terminal grammar elements, including maintaining by the multimodal browser a record of visibility of each voice activated hyperlink, the record of visibility including current visibility and past visibility on a display of the multimodal device of each voice activated hyperlink, the record of visibility further including an ordinal indication, for each voice activated hyperlink scrolled off display, of the sequence in which each such voice activated hyperlink was scrolled off display; recognizing by the multimodal browser speech from a user matching an ambiguous terminal element of the speech recognition grammar; selecting by the multimodal browser a voice activated hyperlink for activation, the selecting carried out in dependence upon the recognized speech and the record of visibility.
-
Citations
18 Claims
-
1. A method of disambiguating a speech recognition grammar in a multimodal application, the multimodal application including voice activated hyperlinks, the voice activated hyperlinks being voice enabled by a speech recognition grammar comprising ambiguous terminal grammar elements, the multimodal application being operable in a multimodal browser on a multimodal device supporting multiple modes of user interaction with the multimodal device, the modes of user interaction including a voice mode and a visual mode, the multimodal browser being operatively coupled to a grammar interpreter, the method comprising:
-
maintaining by the multimodal browser a record of visibility of each voice activated hyperlink, the record of visibility including current visibility and past visibility on a display of the multimodal device of each voice activated hyperlink, the record of visibility further including an ordinal indication, for each voice activated hyperlink scrolled off display, of the sequence in which each such voice activated hyperlink was scrolled off display; recognizing by the multimodal browser speech from a user matching an ambiguous terminal element of the speech recognition grammar; and selecting by the multimodal browser a voice activated hyperlink for activation, the selecting being carried out in dependence upon the recognized speech and the record of visibility. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. Apparatus for disambiguating a speech recognition grammar in a multimodal application, the multimodal application including voice activated hyperlinks, the voice activated hyperlinks being voice enabled by a speech recognition grammar comprising ambiguous terminal grammar elements, the multimodal application being operable in a multimodal browser on a multimodal device supporting multiple modes of user interaction with the multimodal device, the modes of user interaction including a voice mode and a visual mode, the multimodal browser being operatively coupled to a grammar interpreter, the apparatus comprising a computer processor and a computer memory operatively coupled to the computer processor, the computer memory having disposed within it computer program instructions capable of:
-
maintaining by the multimodal browser a record of visibility of each voice activated hyperlink, the record of visibility including current visibility and past visibility on a display of the multimodal device of each voice activated hyperlink, the record of visibility further including an ordinal indication, for each voice activated hyperlink scrolled off display, of the sequence in which each such voice activated hyperlink was scrolled off display; recognizing by the multimodal browser speech from a user matching an ambiguous terminal element of the speech recognition grammar; and selecting by the multimodal browser a voice activated hyperlink for activation, the selecting being carried out in dependence upon the recognized speech and the record of visibility. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer program product for disambiguating a speech recognition grammar in a multimodal application, the multimodal application including voice activated hyperlinks, the voice activated hyperlinks being voice enabled by a speech recognition grammar comprising ambiguous terminal grammar elements, the multimodal application being operable in a multimodal browser on a multimodal device supporting multiple modes of user interaction with the multimodal device, the modes of user interaction including a voice mode and a visual mode, the multimodal browser being operatively coupled to a grammar interpreter, the computer program product disposed upon at least one recordable computer-readable medium, the computer program product comprising computer program instructions capable of:
-
maintaining by the multimodal browser a record of visibility of each voice activated hyperlink, the record of visibility including current visibility and past visibility on a display of the multimodal device of each voice activated hyperlink, the record of visibility further including an ordinal indication, for each voice activated hyperlink scrolled off display, of the sequence in which each such voice activated hyperlink was scrolled off display; recognizing by the multimodal browser speech from a user matching an ambiguous terminal element of the speech recognition grammar; and selecting by the multimodal browser a voice activated hyperlink for activation, the selecting being carried out in dependence upon the recognized speech and the record of visibility. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification