Enabling voice selection of user preferences
First Claim
1. A method for voice enabling a user interface in a multimodal content browser, the method comprising acts of:
- accessing a first speech grammar, the first speech grammar having stored therein at least one voice command, the first speech grammar further storing a mapping of the at least one voice command to a corresponding placeholder identifier;
prior to performing a voice recognition processing, obtaining a second speech grammar from the first speech grammar, the second speech grammar storing a mapping of the at least one voice command to a navigation action that can be triggered by a user through the user interface, wherein the act of obtaining the second speech grammar comprises substituting a string of characters indicative of the navigation action in place of the placeholder identifier in the first speech grammar to obtain the second speech grammar, the string of characters being different from the placeholder identifier;
using the second speech grammar to perform the voice recognition processing, wherein the voice recognition processing comprises recognizing, from received voice input, the at least one voice command in the second speech grammar;
identifying the navigation action specified by the second speech grammar as corresponding to the at least one voice command; and
invoking logic in the user interface consistent with the navigation action.
2 Assignments
0 Petitions
Accused Products
Abstract
A method, system and apparatus for voice enabling a user preference interface in a multimodal content browser. A method for voice enabling a user preference interface in a multimodal content browser can include matching voice input to a bound command in a speech grammar and invoking logic in the user preference interface consistent with the bound command in the speech grammar. The matching step can include comparing voice input to entries in a markup language specified speech grammar and locating the bound command in the specified speech grammar based upon the comparison. In this regard, the method further can include identifying a variable in the bound command, looking up the variable in a table, retrieving a corresponding parameter for the variable from the table, and replacing the variable with the corresponding parameter in the bound command.
-
Citations
20 Claims
-
1. A method for voice enabling a user interface in a multimodal content browser, the method comprising acts of:
-
accessing a first speech grammar, the first speech grammar having stored therein at least one voice command, the first speech grammar further storing a mapping of the at least one voice command to a corresponding placeholder identifier; prior to performing a voice recognition processing, obtaining a second speech grammar from the first speech grammar, the second speech grammar storing a mapping of the at least one voice command to a navigation action that can be triggered by a user through the user interface, wherein the act of obtaining the second speech grammar comprises substituting a string of characters indicative of the navigation action in place of the placeholder identifier in the first speech grammar to obtain the second speech grammar, the string of characters being different from the placeholder identifier; using the second speech grammar to perform the voice recognition processing, wherein the voice recognition processing comprises recognizing, from received voice input, the at least one voice command in the second speech grammar; identifying the navigation action specified by the second speech grammar as corresponding to the at least one voice command; and invoking logic in the user interface consistent with the navigation action. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for voice enabling a user interface in a multimodal content browser, the system comprising:
-
a first speech grammar having stored therein at least one voice command entry that stores a mapping of a voice command to a corresponding placeholder identifier; and at least one processor configured to; obtain a second speech grammar from the first speech grammar, the second speech grammar storing a mapping of the at least one voice command to a navigation action that can be triggered by a user through the user interface, wherein obtaining the second speech grammar comprises substituting a string of characters indicative of the navigation action in place of the placeholder identifier in the first speech grammar to obtain the second speech grammar, the string of characters being different from the placeholder identifier; use the second speech grammar to perform voice recognition processing, wherein the voice recognition processing comprises identifying, based on received voice input, the at least one voice command in at least one voice command entry in said second speech grammar; identify the navigation action specified by the second speech grammar as corresponding to the at least one voice command; and invoke logic in the user interface consistent with the navigation action. - View Dependent Claims (11, 12)
-
-
13. At least one non-transitory computer-readable medium having stored thereon computer instructions which, when executed, perform a method for voice enabling a user interface in a multimodal content browser, the method comprising acts of:
-
accessing a first speech grammar, the first speech grammar having stored therein at least one voice command, the first speech grammar furthering storing a mapping of the at least one voice command to a corresponding placeholder identifier; prior to performing a voice recognition processing, obtaining a second speech grammar from the first speech grammar, the second speech grammar storing a mapping of the at least one voice command to a navigation action that can be triggered by a user through the user interface, wherein the act of obtaining the second speech grammar comprises substituting a string of characters indicative of the navigation action in place of the placeholder identifier in the first speech grammar to obtain the second speech grammar, the string of characters being different from the placeholder identifier; using the second speech grammar to perform the voice recognition processing, wherein the voice recognition processing comprises recognizing, from received voice input, the at least one voice command in the second speech grammar; identifying the navigation action specified by the second speech grammar as corresponding to the at least one voice command; and invoking logic in the user interface consistent with the navigation action. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification