Explicitly registering markup based on verbal commands and exploiting audio context
First Claim
1. A system for providing context based verbal commands to a multi-modal browser, comprising:
- a context-based audio queue ordered based on contents of a page being audibly read by the multi-modal browser to a user;
a store for storing a current context of the audio queue; and
a speech recognition engine for recognizing and registering voice commands, wherein said speech recognition engine compares a current audio context with the context associated with a voice command and causes the browser to perform an action based on the comparison, wherein when a first tag is used to designate the audio context, recognized voice commands associated with the audio context are ignored unless an audio context has been established, and wherein if a context has been established, a Uniform Resource Locator (URL) is followed after appending the current context.
3 Assignments
0 Petitions
Accused Products
Abstract
A generic way of encoding information needed by an application to register voice commands and enable a speech engine are used to tell a browser what to present to the user and what options are available to the user to interact with an application. This is accomplished by enhancements to a markup language which register and enable voice commands that are needed by an application to the speech engine, and provide an audio context for the page scope command by adding a context option to make the page much more flexible and usable. The action of the application can be altered based on the current audio context by adding a context option. The application remains independent of the browser and separate from interaction with the speech engine. The application can accommodate both verbal and visual interactions by registering the verbal commands and identifying to what those commands will translate.
-
Citations
19 Claims
-
1. A system for providing context based verbal commands to a multi-modal browser, comprising:
-
a context-based audio queue ordered based on contents of a page being audibly read by the multi-modal browser to a user; a store for storing a current context of the audio queue; and a speech recognition engine for recognizing and registering voice commands, wherein said speech recognition engine compares a current audio context with the context associated with a voice command and causes the browser to perform an action based on the comparison, wherein when a first tag is used to designate the audio context, recognized voice commands associated with the audio context are ignored unless an audio context has been established, and wherein if a context has been established, a Uniform Resource Locator (URL) is followed after appending the current context. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer implemented method for providing context based verbal commands to a multi-modal browser, comprising the steps of:
-
building a context based audio queue based on the contents of markup language page being audibly read by the multi-modal browser to a user; storing a current context of the audio queue; and recognizing and registering voice commands, wherein the current audio context is compared with a voice command, thereby causing the multi-modal browser to perform an action based on the comparison, wherein when a first tag is used to designate the audio context, recognized voice commands associated with the audio context are ignored unless an audio context has been established, and wherein if a context has been established, a Uniform Resource Locator (URL) is followed after appending the current context. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
Specification