Contextual language understanding for multi-turn language tasks
First Claim
1. A system comprising:
- at least one processor; and
a memory encoding computer executable instructions which, when executed by at least one processor, perform a method for contextual language understanding, comprising;
receiving at least a first natural language expression and a second natural language expression, wherein each of the first natural language expression and the second natural language expression include at least one of words, terms, and phrases;
determining, using a single-turn model, a first weighted prediction of at least one of a domain classification, intent classification, and slot type of the first natural language expression;
determining, using a multi-turn model, a second weighted prediction of at least one of a domain classification, intent classification, and slot type of the second natural language expression using at least one of the first natural language expression and contextual information; and
performing an action based on the second weighted prediction of the second natural language expression.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems are provided for contextual language understanding. A natural language expression may be received at a single-turn model and a multi-turn model for determining an intent of a user. For example, the single-turn model may determine a first prediction of at least one of a domain classification, intent classification, and slot type of the natural language expression. The multi-turn model may determine a second prediction of at least one of a domain classification, intent classification, and slot type of the natural language expression. The first prediction and the second prediction may be combined to produce a final prediction relative to the intent of the natural language expression. An action may be performed based on the final prediction of the natural language expression.
34 Citations
20 Claims
-
1. A system comprising:
-
at least one processor; and a memory encoding computer executable instructions which, when executed by at least one processor, perform a method for contextual language understanding, comprising; receiving at least a first natural language expression and a second natural language expression, wherein each of the first natural language expression and the second natural language expression include at least one of words, terms, and phrases; determining, using a single-turn model, a first weighted prediction of at least one of a domain classification, intent classification, and slot type of the first natural language expression; determining, using a multi-turn model, a second weighted prediction of at least one of a domain classification, intent classification, and slot type of the second natural language expression using at least one of the first natural language expression and contextual information; and performing an action based on the second weighted prediction of the second natural language expression. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a statistical model for receiving at least a first natural language expression and a second natural language expression during a conversational session, wherein each of the first natural language expression and the second natural language expression include at least one of words, terms, and phrases; a single-turn model for determining a first prediction of at least one of a domain classification, intent classification, and slot type of each of the first natural language expression and the second natural language expression; a multi-turn model for determining a second prediction of at least one of a domain classification, intent classification, and slot type of each of the first natural language expression and the second natural language expression; a combination model for combining the first prediction and the second prediction of each of the first natural language expression and the second natural language expression to produce a final prediction relative to an intent of at least the second natural language expression; and a final model for performing an action based on the final prediction of at least the second natural language expression. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. One or more computer-readable storage media, having computer-executable instructions which, when executed by at least one processor, perform a method for building a statistical model for contextual language understanding, comprising:
-
receiving a first natural language expression, wherein the first natural language expression includes at least one of words, terms, and phrases; performing a first action based on a first prediction determined by a single-turn model and a second prediction determined by a multi-turn model; receiving a second natural language expression, wherein the second natural language expression includes at least one of words, terms, and phrases; evaluating at least the first natural language expression, the first action, the first prediction, the second prediction, and the second natural language expression to generate contextual information; aggregating the contextual information into the multi-turn model; and performing a second action based on evaluating at least the first natural language expression, the first action, the first prediction, the second prediction, and the second natural language expression. - View Dependent Claims (19, 20)
-
Specification