Simultaneous dialogue state management using frame tracking
First Claim
1. A system comprising:
- at least one processor; and
a memory storing instructions that when executed by the at least one processor perform a set of operations comprising;
receiving an input utterance of a current frame of a conversation;
generating, using natural language understanding, a predicted value and a predicted act for the input utterance;
determining, using a first model trained to predict slot types, whether the predicted value is a new value for a slot having a pre-existing value in the current frame;
when it is determined that the predicted value is a new value having a pre-existing value in the current frame, creating a new frame of the conversation;
determining, using a second model trained to predict dialogue acts, whether the predicted act relates to a previous frame of the conversation;
when it is determined that the predicted act relates to a previous frame, generating an association between the current frame and the previous frame of the conversation;
determining whether the predicted act switches to the previous frame of the conversation; and
when it is determined that the predicted act switches to the previous frame of the conversation, switching to the previous frame of the conversation;
wherein at least two frames, selected from the group consisting of the new frame, the current frame, and the previous frame, are retained in memory, thereby tracking multiple states of the conversation simultaneously.
1 Assignment
0 Petitions
Accused Products
Abstract
Examples of the present disclosure describe systems and methods relating to conversation state management using frame tracking. In an example, a frame may represent one or more constraints (e.g., parameters, variables, or other information) received from or generated as a result of interactions with a user. Consequently, each frame may represent one or more states of an ongoing conversation. When the user provides new or different information, a new frame may be created to represent the now-current state of the conversation. The previous frame may be retained for later access by what is referred to herein as a “dialog agent,” which is the portion of the system that can search and use previous state-related information. When an utterance is received, a frame to which the utterance relates may be identified. Thus, the dialog agent may track multiple states simultaneously, thereby enabling conversation features that were not previously possible.
29 Citations
20 Claims
-
1. A system comprising:
-
at least one processor; and a memory storing instructions that when executed by the at least one processor perform a set of operations comprising; receiving an input utterance of a current frame of a conversation; generating, using natural language understanding, a predicted value and a predicted act for the input utterance; determining, using a first model trained to predict slot types, whether the predicted value is a new value for a slot having a pre-existing value in the current frame; when it is determined that the predicted value is a new value having a pre-existing value in the current frame, creating a new frame of the conversation; determining, using a second model trained to predict dialogue acts, whether the predicted act relates to a previous frame of the conversation; when it is determined that the predicted act relates to a previous frame, generating an association between the current frame and the previous frame of the conversation; determining whether the predicted act switches to the previous frame of the conversation; and when it is determined that the predicted act switches to the previous frame of the conversation, switching to the previous frame of the conversation; wherein at least two frames, selected from the group consisting of the new frame, the current frame, and the previous frame, are retained in memory, thereby tracking multiple states of the conversation simultaneously. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for dialogue state management, comprising:
-
receiving, from a computing device, an input utterance of a current frame of a conversation; generating, using natural language understanding, a predicted value and a predicted act for the input utterance; determining, using a first model trained to predict slot types, whether the predicted value is a new value for a slot having a pre-existing value in the current frame; based on determining that the predicted value is a new value, creating a new frame of the conversation; determining, using a second model trained to predict dialogue acts, whether the predicted act relates to a previous frame of the conversation; based on determining that the predicted act relates to a previous frame, generating an association between the current frame and the previous frame of the conversation; determining whether the predicted act switches to the previous frame of the conversation; based on determining that the predicted act switches to the previous frame of the conversation, switching to the previous frame of the conversation; retaining at least two frames in memory, selected from the group consisting of the new frame, the current frame, and the previous frame, thereby tracking multiple states of the conversation simultaneously; generating, based on the predicted value and the predicted act, a response to the received input utterance; and providing the generated response to the computing device. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A method for dialogue state management, comprising:
-
receiving an input utterance of a current frame of a conversation; generating, using natural language understanding, a predicted value and a predicted act for the input utterance; determining, using a first model trained to predict slot types, whether the predicted value is a new value for a slot having a pre-existing value in the current frame; based on determining that the predicted value is a new value, creating a new frame of the conversation; determining, using a second model trained to predict dialogue acts, whether the predicted act relates to a previous frame of the conversation; based on determining that the predicted act relates to a previous frame, generating an association between the current frame and the previous frame of the conversation determining whether the predicted act switches to the previous frame of the conversation; and when it is determined that the predicted act switches to the previous frame of the conversation, switching to the previous frame of the conversation, wherein at least two frames, selected from the group consisting of the new frame, the current frame, and the previous frame, are retained in memory, thereby tracking multiple states of the conversation simultaneously. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification