Speech-Enabled Content Navigation And Control Of A Distributed Multimodal Browser
First Claim
1. A computer-implemented method of speech-enabled content navigation and control of a distributed multimodal browser, the distributed multimodal browser providing an execution environment for a multimodal application, the distributed multimodal browser including a graphical user agent and a voice user agent operatively coupled to the graphical user agent, the graphical user agent operating on a multimodal device supporting multiple modes of user interaction including a voice mode and one or more non-voice modes, the voice user agent operating on a voice server, the method comprising:
- transmitting, by the graphical user agent, a link message to the voice user agent, the link message specifying voice commands that control the distributed multimodal browser, the link message also specifying an event corresponding to each voice command;
receiving, by the graphical user agent, a voice utterance from a user, the voice utterance specifying a particular voice command;
transmitting, by the graphical user agent, the voice utterance to the voice user agent for speech recognition by the voice user agent;
receiving, by the graphical user agent, an event message from the voice user agent, the event message specifying a particular event corresponding to the particular voice command specified by the voice utterance; and
controlling, by the graphical user agent, the distributed multimodal browser in dependence upon the particular event.
3 Assignments
0 Petitions
Accused Products
Abstract
Speech-enabled content navigation and control of a distributed multimodal browser is disclosed, the browser providing an execution environment for a multimodal application, the browser including a graphical user agent (‘GUA’) and a voice user agent (‘VUA’), the GUA operating on a multimodal device, the VUA operating on a voice server, that includes: transmitting, by the GUA, a link message to the VUA, the link message specifying voice commands that control the browser and an event corresponding to each voice command; receiving, by the GUA, a voice utterance from a user, the voice utterance specifying a particular voice command; transmitting, by the GUA, the voice utterance to the VUA for speech recognition by the VUA; receiving, by the GUA, an event message from the VUA, the event message specifying a particular event corresponding to the particular voice command; and controlling, by the GUA, the browser in dependence upon the particular event.
187 Citations
20 Claims
-
1. A computer-implemented method of speech-enabled content navigation and control of a distributed multimodal browser, the distributed multimodal browser providing an execution environment for a multimodal application, the distributed multimodal browser including a graphical user agent and a voice user agent operatively coupled to the graphical user agent, the graphical user agent operating on a multimodal device supporting multiple modes of user interaction including a voice mode and one or more non-voice modes, the voice user agent operating on a voice server, the method comprising:
-
transmitting, by the graphical user agent, a link message to the voice user agent, the link message specifying voice commands that control the distributed multimodal browser, the link message also specifying an event corresponding to each voice command; receiving, by the graphical user agent, a voice utterance from a user, the voice utterance specifying a particular voice command; transmitting, by the graphical user agent, the voice utterance to the voice user agent for speech recognition by the voice user agent; receiving, by the graphical user agent, an event message from the voice user agent, the event message specifying a particular event corresponding to the particular voice command specified by the voice utterance; and controlling, by the graphical user agent, the distributed multimodal browser in dependence upon the particular event. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for speech-enabled content navigation and control of a distributed multimodal browser, the distributed multimodal browser providing an execution environment for a multimodal application, the distributed multimodal browser including a graphical user agent and a voice user agent operatively coupled to the graphical user agent, the graphical user agent operating on a multimodal device supporting multiple modes of user interaction including a voice mode and one or more non-voice modes, the voice user agent operating on a voice server, the system comprising one or more computer processors and computer memory operatively coupled to the computer processors, the computer memory having disposed within it computer program instructions capable of:
-
transmitting, by the graphical user agent, a link message to the voice user agent, the link message specifying voice commands that control the distributed multimodal browser, the link message also specifying an event corresponding to each voice command; receiving, by the graphical user agent, a voice utterance from a user, the voice utterance specifying a particular voice command; transmitting, by the graphical user agent, the voice utterance to the voice user agent for speech recognition by the voice user agent; receiving, by the graphical user agent, an event message from the voice user agent, the event message specifying a particular event corresponding to the particular voice command specified by the voice utterance; and controlling, by the graphical user agent, the distributed multimodal browser in dependence upon the particular event. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product for speech-enabled content navigation and control of a distributed multimodal browser, the distributed multimodal browser providing an execution environment for a multimodal application, the distributed multimodal browser including a graphical user agent and a voice user agent operatively coupled to the graphical user agent, the graphical user agent operating on a multimodal device supporting multiple modes of user interaction including a voice mode and one or more non-voice modes, the voice user agent operating on a voice server, the computer program product disposed upon a computer-readable medium, the computer program product comprising computer program instructions capable of:
-
transmitting, by the graphical user agent, a link message to the voice user agent, the link message specifying voice commands that control the distributed multimodal browser, the link message also specifying an event corresponding to each voice command; receiving, by the graphical user agent, a voice utterance from a user, the voice utterance specifying a particular voice command; transmitting, by the graphical user agent, the voice utterance to the voice user agent for speech recognition by the voice user agent; receiving, by the graphical user agent, an event message from the voice user agent, the event message specifying a particular event corresponding to the particular voice command specified by the voice utterance; and controlling, by the graphical user agent, the distributed multimodal browser in dependence upon the particular event. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification