Hierarchical belief states for digital assistants
First Claim
1. An electronic device, comprising:
- one or more processors;
a memory; and
one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for;
receiving a user utterance of a dialogue;
parsing one or more text representations of the user utterance to determine a plurality of semantic interpretations for the user utterance, the plurality of semantic interpretations including more than two concepts or properties, wherein the parsing includes determining that a first concept or property and a second concept or property in the more than two concepts or properties have a joint semantic relationship;
determining, based on the plurality of semantic interpretations, a belief state for the dialogue, the belief state comprising a plurality of dialogue slots representing the more than two concepts or properties, wherein each dialogue slot of the plurality of dialogue slots includes a respective marginal certainty for a respective concept or property represented by the respective dialogue slot, and wherein a first dialogue slot of the plurality of dialogue slots further includes a joint certainty for the first concept or property and the second concept or property based on the determined joint semantic relationship;
determining a plurality of candidate policy actions from the determined belief state;
selecting, based on the marginal certainty of each dialogue slot of the plurality of dialogue slots and the joint certainty, a policy action from the plurality of candidate policy actions; and
performing the selected policy action, including outputting results of the policy action for presentation.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and processes for operating a digital assistant using a hierarchical belief state are disclosed. In an example process, a user utterance of a dialog is received. A belief state for the dialog is determined. The belief state comprises a plurality of dialog slots. Each dialog slot of the plurality of dialog slots includes a respective marginal certainty for a concept or property represented by the respective dialog slot. A first dialog slot of the plurality of dialog slots further includes one or more joint certainties for one or more interpretations arising from the first dialog slot. Based on the marginal certainty of each dialog slot of the plurality of dialog slots and the one or more joint certainties of the first dialog slot, a policy action is selected from a plurality of candidate policy actions that correspond to the belief state. The selected policy action is performed.
-
Citations
50 Claims
-
1. An electronic device, comprising:
-
one or more processors; a memory; and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for; receiving a user utterance of a dialogue; parsing one or more text representations of the user utterance to determine a plurality of semantic interpretations for the user utterance, the plurality of semantic interpretations including more than two concepts or properties, wherein the parsing includes determining that a first concept or property and a second concept or property in the more than two concepts or properties have a joint semantic relationship; determining, based on the plurality of semantic interpretations, a belief state for the dialogue, the belief state comprising a plurality of dialogue slots representing the more than two concepts or properties, wherein each dialogue slot of the plurality of dialogue slots includes a respective marginal certainty for a respective concept or property represented by the respective dialogue slot, and wherein a first dialogue slot of the plurality of dialogue slots further includes a joint certainty for the first concept or property and the second concept or property based on the determined joint semantic relationship; determining a plurality of candidate policy actions from the determined belief state; selecting, based on the marginal certainty of each dialogue slot of the plurality of dialogue slots and the joint certainty, a policy action from the plurality of candidate policy actions; and performing the selected policy action, including outputting results of the policy action for presentation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method for operating a digital assistant, the method comprising:
at an electronic device having one or more processors and memory; receiving a user utterance of a dialogue; parsing one or more text representations of the user utterance to determine a plurality of semantic interpretations for the user utterance, the plurality of semantic interpretations including more than two concepts or properties, wherein the parsing includes determining that a first concept or property and a second concept or property in the more than two concepts or properties have a joint semantic relationship; determining, based on the plurality of semantic interpretations, a belief state for the dialogue, the belief state comprising a plurality of dialogue slots representing the more than two concepts or properties, wherein each dialogue slot of the plurality of dialogue slots includes a respective marginal certainty for a respective concept or property represented by the respective dialogue slot, and wherein a first dialogue slot of the plurality of dialogue slots further includes a joint certainty for the first concept or property and the second concept or property based on the determined joint semantic relationship; determining a plurality of candidate policy actions from the determined belief state; selecting, based on the marginal certainty of each dialogue slot of the plurality of dialogue slots and the joint certainty, a policy action from the plurality of candidate policy actions; and performing the selected policy action, including outputting results of the policy action for presentation. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
31. A non-transitory computer-readable storage medium storing one or more programs configured to be executed by one or more processors of an electronic device and memory, the one or more programs including instructions for:
-
receiving a user utterance of a dialogue; parsing one or more text representations of the user utterance to determine a plurality of semantic interpretations for the user utterance, the plurality of semantic interpretations including more than two concepts or properties, wherein the parsing includes determining that a first concept or property and a second concept or property in the more than two concepts or properties have a joint semantic relationship; determining, based on the plurality of semantic interpretations, a belief state for the dialogue, the belief state comprising a plurality of dialogue slots representing the more than two concepts or properties, wherein each dialogue slot of the plurality of dialogue slots includes a respective marginal certainty for a respective concept or property represented by the respective dialogue slot, and wherein a first dialogue slot of the plurality of dialogue slots further includes a joint certainty for the first concept or property and the second concept or property based on the determined joint semantic relationship; determining a plurality of candidate policy actions from the determined belief state; selecting, based on the marginal certainty of each dialogue slot of the plurality of dialogue slots and the joint certainty, a policy action from the plurality of candidate policy actions; and performing the selected policy action, including outputting results of the policy action for presentation. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50)
-
Specification