Automatic multimodal enabling of existing web content

US 20050273487A1
Filed: 07/30/2004
Published: 12/08/2005
Est. Priority Date: 06/04/2004
Status: Abandoned Application

First Claim

Patent Images

1. A method of providing multimodality for an existing web content, comprising:

loading a web page by a browser in a user device;

generating grammar for the loaded web page by a software agent;

displaying the loaded web page to a user;

recognizing at least one user input; and

navigating the browser based on the recognized user input, wherein when the at least one user input is voice input, recognizing the voice input based on the generated grammar and navigating the browser based on the recognized user input and the generated grammar.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system and a method for enabling existing web content to become multimodal. The system has a browser providing a user with markup language web pages. In addition, the system has an agent for creating dynamic grammar for a web page loaded by the browser. The dynamic grammar has one or more commands and one or more corresponding labels. A command is a markup language tag or a markup object used to navigate the browser and a label is content text that corresponds to the command. The system also includes a speech recognition engine, which receives user voice input and compares the received input to the labels in the dynamic grammar. When the speech recognition engine finds a match, the speech recognition engine transmits the corresponding command to the agent and the agent navigates the browser using the command.

41 Citations

View as Search Results

40 Claims

1. A method of providing multimodality for an existing web content, comprising:
- loading a web page by a browser in a user device;
  
  generating grammar for the loaded web page by a software agent;
  
  displaying the loaded web page to a user;
  
  recognizing at least one user input; and
  
  navigating the browser based on the recognized user input, wherein when the at least one user input is voice input, recognizing the voice input based on the generated grammar and navigating the browser based on the recognized user input and the generated grammar.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
- - 2. The method according to claim 1, wherein when the recognized user input is received via mouse, stylus or keyboard, navigating the browser using graphic user interface.
  - 3. The method according to claim 2, further comprising parsing the loaded web page for commands, and generating the grammar based on the extracted commands and corresponding labels.
  - 4. The method according to claim 3, wherein when the user voice input is the label, the extracted command is transmitted to the browser, and based on the command the browser is navigated.
  - 5. The method according to claim 4, wherein the extracted command is a markup language tag.
  - 6. The method according to claim 5, wherein the loaded web page is written in a markup language.
  - 7. The method according to claim 6, wherein the loaded web page is written in at least one of a hyper text markup language, a wireless markup language and an extensible markup language.
  - 8. The method according to claim 2, wherein the wireless device is a mobile phone.
  - 9. The method according to claim 2, wherein the wireless device is one of a pocketPC, a Bluetooth enabled device, a WiFi enabled device or a GPRS terminal.
  - 10. The method according to claim 2, wherein for each loaded web page new grammar is generated.
  - 11. The method according to claim 2, wherein the grammar is generated at run time when the browser requests a new web page, the grammar is generated for recognizing the user voice input and wherein the web page has dynamic content.
  - 12. The method according to claim 2, wherein said software agent comprises a client agent and a server agent.
  - 13. The method according to claim 12, wherein the client agent informs the server agent of the loading web page and wherein the server agent generates the grammar.
  - 14. The method according to claim 13, wherein the client agent sends an address of the web page being loaded to the server agent.
  - 15. The method according to claim 14, wherein the server agent parses the loaded web page for commands, and generates the grammar based on the extracted commands, and wherein the server agent transmits the generated grammar to a speech recognition engine.
  - 16. The method according to claim 15, wherein the speech recognition engine recognizes the user voice input based on the grammar from the server agent, and transmits a command based on the recognized input to the browser, and the browser is navigated based on the command.
  - 17. The method according to claim 2, wherein said loaded web page is a page from a web application.
  - 18. The method according to claim 2, wherein said loaded web page is a dynamic web page and said grammar is generated when the web page is being loaded.
  - 19. The method according to claim 2, wherein said loaded web page is a dynamic web page displayed to a user, and wherein the grammar is generated after the web page is loaded.
  - 20. The method according to claim 19, wherein when the grammar is generated, an indicating means indicates that the web page is voice enabled.

21. A system for enabling existing web content to become multimodal, comprising:
- a browser providing a user with a markup language web pages;
  
  an agent creating dynamic grammar for a web page loaded by the browser, the dynamic grammar comprises at least one command and at least one corresponding label; and
  
  a speech recognition engine receiving user voice input and comparing the received input to the at least one label in the dynamically generated grammar, wherein when the speech recognition engine finds a match, the speech recognition engine transmits the corresponding command to the agent and wherein the agent navigates the browser using the command.
- View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
- - 22. The system according to claim 21, wherein the user is enabled to navigate the browser via a mouse, a keyboard, a stylus and voice.
  - 23. The system according to claim 22, wherein the browser is in a wireless device and wherein for each loaded web page new grammar is generated.
  - 24. The system according to claim 22, wherein the extracted command is a markup language tag.
  - 25. The system according to claim 22, wherein the loaded web page is written in a text markup language.
  - 26. The system according to claim 25, wherein the loaded web page is written in at least one of a hyper text markup language, a wireless markup language and an extensible markup language.
  - 27. The system according to claim 22, wherein the wireless device is a mobile phone.
  - 28. The system according to claim 22, wherein the wireless device is one of a pocketPC, a Bluetooth enabled device, a WiFi enabled device and a GPRS terminal.
  - 29. The system according to claim 22, wherein the grammar is dynamic grammar generated at run time and wherein the web page has dynamic contents.
  - 30. The system according to claim 29, wherein the dynamic contents is time sensitive information.
  - 31. The system according to claim 30, wherein the time sensitive information comprises news stories, weather information, financial news, and sports scores.
  - 32. The system according to claim 29, wherein the dynamic grammar is generated at runtime for an email application.
  - 33. The system according to claim 22, wherein the agent is a software agent comprising a client agent in the wireless device and a server agent.
  - 34. The system according to claim 33, wherein the client agent informs the server agent when the web page is loaded by the browser, and in response the server agent requests the web page from a web server or an application specific server via an IP network, and when the page is received by the server agent, the server agent parses the page creating the dynamic grammar.
  - 35. The system according to claim 34, wherein the server agent passes the dynamic grammar to the speech recognition engine.
  - 36. The system according to claim 34, wherein the server agent and the speech recognition engine is in the same server, and wherein the client agent and the browser are in the same wireless device remote from the server.
  - 37. The system according to claim 34, wherein the speech recognition engine transmits a command that corresponds to the label spoken by the user and recognized by the speech recognition engine, the command is transmitted to the client agent and wherein the client agent navigates the browser using the command.
  - 38. The system according to claim 34, wherein the speech recognition engine receives the user voice input from the wireless device, and the dynamic grammar from the service agent located in a remote server.
  - 39. The system according to claim 38, wherein the web page comprises at least one of:
    - an HTML page, a WML page, and an XML page.
  - 40. The system according to claim 21, wherein the web page is an HTML encrypted page.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Comverse Limited (Mavenir Group Holdings Ltd.)
Original Assignee
Comverse Limited (Mavenir Group Holdings Ltd.)
Inventors
Mayblum, Amir, Cogan, Michael

Application Number

US10/902,063
Publication Number

US 20050273487A1
Time in Patent Office

Days
Field of Search
US Class Current

709/202
CPC Class Codes

H04L 65/1101   Session protocols

H04L 67/02   based on web technology, e....

H04L 67/75   Indicating network or usage...

H04L 69/329   in the application layer [O...

Automatic multimodal enabling of existing web content

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

41 Citations

40 Claims

Specification

Solutions

Use Cases

Quick Links

Automatic multimodal enabling of existing web content

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

41 Citations

40 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links