Speech-enabled web content searching using a multimodal browser
First Claim
1. A method of speech-enabled searching of web content using a multimodal browser, the method implemented with one or more grammars in an automatic speech recognition (‘
- ASR’
) engine, with the multimodal browser operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal browser operatively coupled to the ASR engine, the method comprising;
rendering, by the multimodal browser, web content;
searching, by the multimodal browser, the rendered web content for a search phrase, including matching the search phrase to at least one portion of the rendered web content, yielding a matched search result, the search phrase specified by a first voice utterance received from a user and a search grammar; and
in response to a second voice utterance received from the user;
using an action grammar comprising one or more entries to recognize the second voice utterance as corresponding to a first entry of the one or more entries, the action grammar specifying,for the first entry of the one or more entries, an associated first action to be taken in dependence upon the matched search result, andfor a second entry of the one or more entries, an associated second action to be taken in dependence upon the same matched search result, the second action being different from the first action, andperforming, by the multimodal browser, the first action in dependence upon the matched search result associated with the first entry.
3 Assignments
0 Petitions
Accused Products
Abstract
Speech-enabled web content searching using a multimodal browser implemented with one or more grammars in an automatic speech recognition (‘ASR’) engine, with the multimodal browser operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal browser operatively coupled to the ASR engine, includes: rendering, by the multimodal browser, web content; searching, by the multimodal browser, the web content for a search phrase, including yielding a matched search result, the search phrase specified by a first voice utterance received from a user and a search grammar; and performing, by the multimodal browser, an action in dependence upon the matched search result, the action specified by a second voice utterance received from the user and an action grammar.
347 Citations
20 Claims
-
1. A method of speech-enabled searching of web content using a multimodal browser, the method implemented with one or more grammars in an automatic speech recognition (‘
- ASR’
) engine, with the multimodal browser operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal browser operatively coupled to the ASR engine, the method comprising;rendering, by the multimodal browser, web content; searching, by the multimodal browser, the rendered web content for a search phrase, including matching the search phrase to at least one portion of the rendered web content, yielding a matched search result, the search phrase specified by a first voice utterance received from a user and a search grammar; and in response to a second voice utterance received from the user; using an action grammar comprising one or more entries to recognize the second voice utterance as corresponding to a first entry of the one or more entries, the action grammar specifying, for the first entry of the one or more entries, an associated first action to be taken in dependence upon the matched search result, and for a second entry of the one or more entries, an associated second action to be taken in dependence upon the same matched search result, the second action being different from the first action, and performing, by the multimodal browser, the first action in dependence upon the matched search result associated with the first entry. - View Dependent Claims (2, 3, 4, 5, 6, 7)
- ASR’
-
8. Apparatus for speech-enabled searching of web content using a multimodal browser operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal browser operatively coupled to an automatic speech recognition (‘
- ASR’
) engine, the apparatus comprising;a computer processor; and a computer memory operatively coupled to the computer processor, the computer memory having stored thereon computer program instructions that, when executed by the computer processor, perform a method comprising acts of; rendering, by the multimodal browser, web content; searching, by the multimodal browser, the rendered web content for a search phrase, including matching the search phrase to at least one portion of the rendered web content, yielding a matched search result, the search phrase specified by a first voice utterance received from a user and a search grammar; and in response to a second voice utterance received from the user; using an action grammar comprising one or more entries to recognize the second voice utterance as corresponding to a first entry of the one or more entries, the action grammar specifying, for the first entry of the one or more entries, an associated first action to be taken in dependence upon the matched search result, and for a second entry of the one or more entries, an associated second action to be taken in dependence upon the same matched search result, the second action being different from the first action, and performing, by the multimodal browser, the first action in dependence upon the matched search result associated with the first entry. - View Dependent Claims (9, 10, 11, 12, 13)
- ASR’
-
14. A computer-readable recordable medium encoded with instructions that, when executed, perform a method for speech-enabled searching of web content using a multimodal browser operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal browser operatively coupled to an automatic speech recognition (‘
- ASR’
) engine, the method comprising acts of;rendering, by the multimodal browser, web content; searching, by the multimodal browser, the rendered web content for a search phrase, including matching the search phrase to at least one portion of the rendered web content, yielding a matched search result, the search phrase specified by a first voice utterance received from a user and a search grammar; and in response to a second voice utterance received from the user; using an action grammar comprising one or more entries to recognize the second voice utterance as corresponding to a first entry of the one or more entries, the action grammar specifying, for the first entry of the one or more entries, an associated first action to be taken in dependence upon the matched search result, and for a second entry of the one or more entries, an associated second action to be taken in dependence upon the same matched search result, the second action being different from the first action, and performing, by the multimodal browser, the first action in dependence upon the matched search result associated with the first entry. - View Dependent Claims (15, 16, 17, 18, 19, 20)
- ASR’
Specification