Method For Finding Elements In A Webpage Suitable For Use In A Voice User Interface (Disambiguation)
First Claim
1. A method for providing a voice user interface, comprising:
- analyzing a document to identify a plurality of interactive elements in the document, each interactive element of the plurality of interactive elements comprises an associated phrase;
rendering the document to provide a display on a display device, the associated phrases are provided in the display;
comparing a voice command of a user to a plurality of phrases, the plurality of phrases comprise the associated phrases of the plurality of interactive elements;
based on the comparing, determining a matching score for each interactive element indicating a degree of matching of its associated phrase to the voice command;
identifying one of the interactive elements as a closest match to the voice command based on its matching score; and
based on the matching scores, deciding whether to generate a click event for the one of the interactive elements which is the closest match or to initiate a disambiguation process which allows the user to select from among a group of the interactive elements which comprise matching scores which are highest among the plurality of interactive elements.
3 Assignments
0 Petitions
Accused Products
Abstract
A disambiguation process for a voice interface for web pages or other documents. The process identifies interactive elements such as links, obtains one or more phrases of each interactive element, such as link text, title text and alternative text for images, and adds the phrases to a grammar which is used for speech recognition. A group of interactive elements are identified as potential best matches to a voice command when there is no single, clear best match. The disambiguation process modifies a display of the document to provide unique labels for each interactive element in the group, and the user is prompted to provide a subsequent spoke command to identify one of the unique labels. The selected unique label is identified and a click event is generated for the corresponding interactive element.
25 Citations
20 Claims
-
1. A method for providing a voice user interface, comprising:
-
analyzing a document to identify a plurality of interactive elements in the document, each interactive element of the plurality of interactive elements comprises an associated phrase; rendering the document to provide a display on a display device, the associated phrases are provided in the display; comparing a voice command of a user to a plurality of phrases, the plurality of phrases comprise the associated phrases of the plurality of interactive elements; based on the comparing, determining a matching score for each interactive element indicating a degree of matching of its associated phrase to the voice command; identifying one of the interactive elements as a closest match to the voice command based on its matching score; and based on the matching scores, deciding whether to generate a click event for the one of the interactive elements which is the closest match or to initiate a disambiguation process which allows the user to select from among a group of the interactive elements which comprise matching scores which are highest among the plurality of interactive elements. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computing device, comprising:
-
a display device; a storage device which stores code and a document; and a processor associated with the display device and the storage device, the processor executes the code to; analyze a document to identify a plurality of interactive elements in the document, each interactive element of the plurality of interactive elements comprises an associated phrase; render the document to provide a display on a display device, the associated phrases are provided in the display; compare a voice command of a user to a plurality of phrases, the plurality of phrases comprise the associated phrases of the plurality of interactive elements; based on the comparing, determine a matching score for each interactive element indicating a degree of matching of its associated phrase to the voice command, the matching scores are based on a number of words in the associated phrases which match the sequence of words; identify one of the interactive elements as a closest match to the voice command based on its matching score; and based on the identifying, generating a click event for the one of the interactive elements which is the closest match. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A computer-readable storage device having computer-readable software embodied thereon for programming a processor to perform a method for providing a voice user interface, the method comprising:
-
identify a plurality of links in a document, each link comprises link text, the link text for at least one of the links comprises a sequence of words; displaying the document including the link text on a display device; comparing a voice command of a user to a plurality of phrases, the plurality of phrases comprise the link text of the plurality of links, the comparing comprises comparing the sequence of words to the voice command and determining a longest subset of the sequence of words which matches the voice command; based on the comparing, determining a matching score for each link indicating a degree of matching of its associated link text to the voice command, wherein the matching score for the at least one of the links is based on a number of words in the longest subset of the sequence of words which matches the voice command; and identifying one of the links as a closest match to the voice command based on its matching score. - View Dependent Claims (19, 20)
-
Specification