Methods and apparatus for voiced-enabling a web application
First Claim
1. A method of enabling voice interaction for invoking at least one capability of a web application including at least one web page rendered by a web browser, the method comprising:
- detecting, by an agent associated with the web browser, a first document object model event;
analyzing, in response to detecting the first document object model event, a document object model of the at least one web page to identify one or more items in the document object model at a first point in time;
determining based, at least in part, on the identified one or more items, that the at least one web page comprises the at least one capability at the first point in time;
enabling voice input to invoke the at least one capability of the web application in response to the identifying that the at least one web page comprises the at least one capability at the first point in time, wherein enabling voice input comprises updating at least one grammar associated with a speech engine based, at least in part, on the one or more items identified in the document object model at the first point in time;
detecting, by the agent, a second document object model event indicating that a context of the web application has changed since the first point in time;
analyzing, in response to detecting the second document object model event, the document object model of the at least one web page to identify at least one new item in the document object model at a second point in time; and
updating the at least one grammar based, at least in part, on the at least one new item identified in the document object model at the second point in time.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus for voice-enabling a web application, wherein the web application includes one or more web pages rendered by a web browser on a computer. At least one information source external to the web application is queried to determine whether information describing a set of one or more supported voice interactions for the web application is available, and in response to determining that the information is available, the information is retrieved from the at least one information source. Voice input for the web application is then enabled based on the retrieved information.
-
Citations
11 Claims
-
1. A method of enabling voice interaction for invoking at least one capability of a web application including at least one web page rendered by a web browser, the method comprising:
-
detecting, by an agent associated with the web browser, a first document object model event; analyzing, in response to detecting the first document object model event, a document object model of the at least one web page to identify one or more items in the document object model at a first point in time; determining based, at least in part, on the identified one or more items, that the at least one web page comprises the at least one capability at the first point in time; enabling voice input to invoke the at least one capability of the web application in response to the identifying that the at least one web page comprises the at least one capability at the first point in time, wherein enabling voice input comprises updating at least one grammar associated with a speech engine based, at least in part, on the one or more items identified in the document object model at the first point in time; detecting, by the agent, a second document object model event indicating that a context of the web application has changed since the first point in time; analyzing, in response to detecting the second document object model event, the document object model of the at least one web page to identify at least one new item in the document object model at a second point in time; and updating the at least one grammar based, at least in part, on the at least one new item identified in the document object model at the second point in time. - View Dependent Claims (2, 3, 4)
-
-
5. A non-transitory computer-readable storage medium encoded with a plurality of instructions that, when executed by a computer, performs a method of enabling voice interaction for invoking at least one capability of a web application including at least one web page rendered by a web browser, the method comprising:
-
detecting, by an agent associated with the web browser, a first document object model event; analyzing, in response to detecting the first document object model event, a document object model of the at least one web page to identify one or more items in the document object model at a first point in time; determining based, at least in part, on the identified one or more items, that the at least one web page comprises the at least one capability at the first point in time; enabling voice input to invoke the at least one capability of the web application in response to the identifying that the at least one web page comprises the at least one capability at the first point in time, wherein enabling voice input comprises updating at least one grammar associated with a speech engine based, at least in part, on the one or more items identified in the document object model at the first point in time; detecting, by the agent, a second document object model event indicating that a context of the web application has changed since the first point in time; analyzing, in response to detecting the second document object model event, the document object model of the at least one web page to identify at least one new item in the document object model at a second point in time; and updating the at least one grammar based, at least in part, on the at least one new item identified in the document object model at the second point in time. - View Dependent Claims (6, 7, 8)
-
-
9. A computer, comprising:
at least one processor programmed to; detect a first document object model event; analyze, in response to detecting the first document object model event, a document object model of the at least one web page to identify one or more items in the document object model at a first point in time; determine based, at least in part, on the identified one or more items, that the at least one web page comprises the at least one capability at the first point in time enable voice input to invoke the at least one capability of the web application in response to the identifying that the at least one web page comprises the at least one capability at the first point in time, wherein enabling voice input comprises updating at least one grammar associated with a speech engine based, at least in part, on the one or more items identified in the document object model at the first point in time; detect a second document object model event indicating that a context of the web application has changed since the first point in time; analyze, in response to detecting the second document object model event, the document object model of the at least one web page to identify at least one new item in the document object model at a second point in time; and update the at least one grammar based, at least in part, on the at least one new item identified in the document object model at the second point in time. - View Dependent Claims (10, 11)
Specification