VOICE RECOGNITION OF COMMANDS EXTRACTED FROM USER INTERFACE SCREEN DEVICES
First Claim
1. A method comprising:
- using a computing system having at least one processor to perform a process, the process comprising;
identifying a markup language description of a user interface page;
identifying a voice command mapping for the user interface page, wherein the voice command mapping is generated by parsing the markup language description to identify at least one user interface object configured to perform at least one operation responsive to a keyboard or mouse or pointing device, wherein the parsing action does not create a modified version of the page;
receiving an utterance to be converted to a text representation of the utterance;
determining a plurality of matches between the text representation of the utterance and multiple matching voice commands; and
performing a confirmation of a single matching voice command from among the plurality of matches.
1 Assignment
0 Petitions
Accused Products
Abstract
A method, system, and computer program product for human interface design. Embodiments proceed upon receiving a markup language description of user interface pages (e.g., HTML pages), then, without modifying the user interface page, parsing the markup language description to identify user interface objects configured to perform an operation responsive to a keyboard or mouse or pointing device. One or more mapping techniques serve to relate the parsed-out operation(s) to one or more voice commands. In some embodiments, the parser recognizes interface objects in forms such as a button, a textbox, a checkbox, or an option menu, and the voice commands correspond to an aspect that is displayed when rendering the interface object (e.g., a button label, a menu option, etc.). After receiving a user utterance, the utterance is converted into a text representation which in turn is mapped to voice commands that were parsed from the user interface page.
220 Citations
20 Claims
-
1. A method comprising:
-
using a computing system having at least one processor to perform a process, the process comprising; identifying a markup language description of a user interface page; identifying a voice command mapping for the user interface page, wherein the voice command mapping is generated by parsing the markup language description to identify at least one user interface object configured to perform at least one operation responsive to a keyboard or mouse or pointing device, wherein the parsing action does not create a modified version of the page; receiving an utterance to be converted to a text representation of the utterance; determining a plurality of matches between the text representation of the utterance and multiple matching voice commands; and performing a confirmation of a single matching voice command from among the plurality of matches. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer program product embodied in a non-transitory computer readable medium, the computer readable medium having stored thereon a sequence of instructions which, when executed by a processor causes the processor to execute a process, the process comprising:
-
identifying a markup language description of a user interface page; identifying a voice command mapping for the user interface page, wherein the voice command mapping is generated by parsing the markup language description to identify at least one user interface object configured to perform at least one operation responsive to a keyboard or mouse or pointing device, wherein the parsing action does not create a modified version of the page; receiving an utterance to be converted to a text representation of the utterance; determining a plurality of matches between the text representation of the utterance and multiple matching voice commands; and performing a confirmation of a single matching voice command from among the plurality of matches. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer system comprising:
-
a parser module to identify a markup language description of a user interface page and to identify a voice command mapping for the user interface page, wherein the voice command mapping is generated by parsing the markup language description to identify at least one user interface object configured to perform at least one operation responsive to a keyboard or mouse or pointing device, wherein the parsing does not create a modified version of the page; a receiving module to receive an utterance to be converted to a text representation of the utterance wherein the utterance is used to determine a plurality of matches between the text representation of the utterance and multiple matching voice commands; and a confirmation module to perform a confirmation of a single matching voice command from among the plurality of matches. - View Dependent Claims (19, 20)
-
Specification