Automatic multimodal enabling of existing web content
First Claim
1. A method of providing multimodality for an existing web content, comprising:
- loading a web page by a browser in a user device;
generating grammar for the loaded web page by a software agent;
displaying the loaded web page to a user;
recognizing at least one user input; and
navigating the browser based on the recognized user input, wherein when the at least one user input is voice input, recognizing the voice input based on the generated grammar and navigating the browser based on the recognized user input and the generated grammar.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and a method for enabling existing web content to become multimodal. The system has a browser providing a user with markup language web pages. In addition, the system has an agent for creating dynamic grammar for a web page loaded by the browser. The dynamic grammar has one or more commands and one or more corresponding labels. A command is a markup language tag or a markup object used to navigate the browser and a label is content text that corresponds to the command. The system also includes a speech recognition engine, which receives user voice input and compares the received input to the labels in the dynamic grammar. When the speech recognition engine finds a match, the speech recognition engine transmits the corresponding command to the agent and the agent navigates the browser using the command.
41 Citations
40 Claims
-
1. A method of providing multimodality for an existing web content, comprising:
-
loading a web page by a browser in a user device;
generating grammar for the loaded web page by a software agent;
displaying the loaded web page to a user;
recognizing at least one user input; and
navigating the browser based on the recognized user input, wherein when the at least one user input is voice input, recognizing the voice input based on the generated grammar and navigating the browser based on the recognized user input and the generated grammar. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A system for enabling existing web content to become multimodal, comprising:
-
a browser providing a user with a markup language web pages;
an agent creating dynamic grammar for a web page loaded by the browser, the dynamic grammar comprises at least one command and at least one corresponding label; and
a speech recognition engine receiving user voice input and comparing the received input to the at least one label in the dynamically generated grammar, wherein when the speech recognition engine finds a match, the speech recognition engine transmits the corresponding command to the agent and wherein the agent navigates the browser using the command. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
Specification