Using multiple modality input to feedback context for natural language understanding
First Claim
Patent Images
1. A computer-implemented method for recognizing speech, the method comprising:
- receiving a spoken query for a first input field;
identifying a category for the spoken query based on at least a user-inputted value associated with a second input field;
converting the spoken query to text according to a statistical dialog manager of a spoken dialog system of a computerized natural language service, the statistical dialog manager associated with the identified category and utilized to statistically weight terms belonging to the identified category; and
providing a response to the spoken query.
2 Assignments
0 Petitions
Accused Products
Abstract
Input context for a statistical dialog manager may be provided. Upon receiving a spoken query from a user, the query may be categorized according to at least one context clue. The spoken query may then be converted to text according to a statistical dialog manager associated with the category of the query and a response to the spoken query may be provided to the user.
44 Citations
20 Claims
-
1. A computer-implemented method for recognizing speech, the method comprising:
-
receiving a spoken query for a first input field; identifying a category for the spoken query based on at least a user-inputted value associated with a second input field; converting the spoken query to text according to a statistical dialog manager of a spoken dialog system of a computerized natural language service, the statistical dialog manager associated with the identified category and utilized to statistically weight terms belonging to the identified category; and providing a response to the spoken query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for providing input context to a speech recognizer, the system comprising:
-
at least one processor; and a memory operatively connected to the at least one processor, the memory storing instructions that when executed by the at least one processor perform a set of operations comprising; receiving a spoken query for a first input field; identifying a category for the spoken query based on a user-inputted value entered into a second input field; converting the spoken query to text according to a statistical dialog manager of a spoken dialog system of a computerized natural language service, the statistical dialog manager associated with the category and utilized to statistically weight terms belonging to the category; and providing a response to the spoken query. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A storage device, having computer-executable instructions that, when executed by at least one processor, perform a method receiving spoken input, the method comprising:
-
analyzing contextual data associated with a web page, wherein the plurality of contextual data comprises a plurality of inputs and responses associated with an interactive form of the web page; based on the analysis, determining a plurality of categories comprising a category and one or more subcategories of the category; building a plurality of statistical dialog managers, wherein a first statistical dialog manager in the plurality of statistical dialog managers is associated with the category and at least one additional statistical dialog manager in the plurality of statistical dialog managers is associated with each of the one or more subcategories; receiving a spoken user input via a web browser application in communication with the web page; categorizing the spoken user input according to at least one of a name of a first input field and a user-inputted value associated with a second input field; determining a subcategory for the spoken user input; converting the categorized spoken user input to text via the first statistical dialog manager associated with the category of the spoken user input and the at least one additional statistical dialog manager associated with the one or more subcategories, wherein the subcategory is based on additional page elements within the web page; and applying the converted spoken user input to a web page element. - View Dependent Claims (18, 19, 20)
-
Specification