Communication system
First Claim
1. A method for controlling the response to spoken language input, comprising:
- receiving user data from a device;
receiving a first spoken language input from the device;
identifying tags within the first spoken language input;
searching a knowledge base framework based on the tags and the user data, wherein the knowledge base framework is a database that includes a plurality of entities, attributes, and relationships between the entities and the attributes;
identifying entities, attributes, and relationship within the knowledge base framework that match at least one of the tags and the user data;
creating a state graph based on a portion of the knowledge base framework that includes any matched entities, matched attributes, and identified relationships and based on the tags, wherein the state graph is created at least in part by transforming the portion of the knowledge base framework into a probabilistic model graph by replacing the identified relationships with weighted connections and by assigning a confidence indicator to each node of the state graph;
determining at least one goal based on the state graph; and
sending instructions to perform an action to the device based on the at least one goal, the weighted connections, and the confidence indicators.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for responding to spoken language input or multi-modal input are described herein. More specifically, one or more user intents are determined or inferred from the spoken language input or multi-modal input to determine one or more user goals via a dialogue belief tracking system. The systems and methods disclosed herein utilize the dialogue belief tracking system to perform actions based on the determined one or more user goals and allow a device to engage in human like conversation with a user over multiple turns of a conversation. Preventing the user from having to explicitly state each intent and desired goal while still receiving the desired goal from the device, improves a user'"'"'s ability to accomplish tasks, perform commands, and get desired products and/or services. Additionally, the improved response to spoken language inputs from a user improves user interactions with the device.
8 Citations
20 Claims
-
1. A method for controlling the response to spoken language input, comprising:
-
receiving user data from a device; receiving a first spoken language input from the device; identifying tags within the first spoken language input; searching a knowledge base framework based on the tags and the user data, wherein the knowledge base framework is a database that includes a plurality of entities, attributes, and relationships between the entities and the attributes; identifying entities, attributes, and relationship within the knowledge base framework that match at least one of the tags and the user data; creating a state graph based on a portion of the knowledge base framework that includes any matched entities, matched attributes, and identified relationships and based on the tags, wherein the state graph is created at least in part by transforming the portion of the knowledge base framework into a probabilistic model graph by replacing the identified relationships with weighted connections and by assigning a confidence indicator to each node of the state graph; determining at least one goal based on the state graph; and sending instructions to perform an action to the device based on the at least one goal, the weighted connections, and the confidence indicators. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
a computing device including a processing unit and a memory, the processing unit implementing a spoken language system and a dialogue state belief tracking system, the spoken language system is operable to; receive a spoken language input, identify tags within the spoken language input, and communicate with the dialogue state belief tracking system; and wherein the dialogue state belief tracking system is operable to; communicate with the spoken language system, search a knowledge base framework based on the tags identified by the spoken language system; identify entities, attributes, and relationships within the knowledge base framework that match at least some of the tags; create a state graph based on a portion of the knowledge base framework that includes any matched entities, matched attributes, and identified relationships, wherein the state graph is formed by transforming the portion into a probabilistic model graph, and wherein the state graph includes a confidence indicator for each node of the state graph; rank nodes of the state graph; determine at least one goal based on the rank of the nodes of the state graph; and send instructions to perform an action based on the at least one goal. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A computer-readable storage device including computer-executable instructions stored thereon which, when executed by a computing system in a distributed network, cause the computing system to perform a method comprising:
-
receiving user data from a device; receiving a second spoken language input from the device; identifying tags within the second spoken language input in view of previously determined tags from a first spoken language input in a conversation between a user and the device; searching a knowledge base framework based on the tags and the user data; identifying entities, attributes, and relationship within the knowledge base framework that match at least one of the tags and the user data; creating an updated state graph by aligning a portion of the knowledge base framework that includes any matched entities, matched attributes, and identified relationships with a stored state graph; determining at least one user goal based on the updated state graph; and sending instructions to perform an action to the device based on the at least one user goal and a confidence indicator for the at least one user goal.
-
Specification