System and method for dialog modeling
First Claim
1. A method comprising:
- training a plurality of hierarchical, parsed-based dialog models comprising a shift-reduce model, a start-complete model, and a connection path model, wherein the plurality of hierarchical, parsed-based dialog models operate incrementally from left to right and only analyze an immediately preceding dialog context;
parsing, via a processor, spoken dialogs with a hierarchical, parse-based dialog model from the plurality of hierarchical, parsed-based dialog models, to yield parsed spoken dialogs, wherein the spoken dialogs are annotated to indicate dialog acts, feature vectors, and task/subtask information;
constructing a functional task structure of the parsed spoken dialogs, wherein the functional task structure does not comprise a rhetorical structure of the parsed spoken dialogs;
predicting a likely next dialog act using the functional task structure, the feature vectors, and the hierarchical, parsed-based dialog model; and
selecting a language model for a next utterance based on the likely next dialog act.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, computer-implemented methods, and computer-readable media for dialog modeling. The method includes receiving spoken dialogs annotated to indicate dialog acts and task/subtask information, parsing the spoken dialogs with a hierarchical, parse-based dialog model which operates incrementally from left to right and which only analyzes a preceding dialog context to generate parsed spoken dialogs, and constructing a functional task structure of the parsed spoken dialogs. The method can further either interpret user utterances with the functional task structure of the parsed spoken dialogs or plan system responses to user utterances with the functional task structure of the parsed spoken dialogs. The parse-based dialog model can be a shift-reduce model, a start-complete model, or a connection path model.
46 Citations
20 Claims
-
1. A method comprising:
-
training a plurality of hierarchical, parsed-based dialog models comprising a shift-reduce model, a start-complete model, and a connection path model, wherein the plurality of hierarchical, parsed-based dialog models operate incrementally from left to right and only analyze an immediately preceding dialog context; parsing, via a processor, spoken dialogs with a hierarchical, parse-based dialog model from the plurality of hierarchical, parsed-based dialog models, to yield parsed spoken dialogs, wherein the spoken dialogs are annotated to indicate dialog acts, feature vectors, and task/subtask information; constructing a functional task structure of the parsed spoken dialogs, wherein the functional task structure does not comprise a rhetorical structure of the parsed spoken dialogs; predicting a likely next dialog act using the functional task structure, the feature vectors, and the hierarchical, parsed-based dialog model; and selecting a language model for a next utterance based on the likely next dialog act. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
a processor; and a non-transitory computer-readable storage medium having instructions stored, which when executed on the processor, cause the processor to perform operations comprising; training a plurality of hierarchical, parsed-based dialog models comprising a shift-reduce model, a start-complete model, and a connection path model, wherein the plurality of hierarchical, parsed-based dialog models operate incrementally from left to right and only analyze an immediately preceding dialog context; parsing, via a processor, spoken dialogs with a hierarchical, parse-based dialog model from the plurality of hierarchical, parsed-based dialog models, to yield parsed spoken dialogs, wherein the spoken dialogs are annotated to indicate dialog acts, feature vectors, and task/subtask information; constructing a functional task structure of the parsed spoken dialogs, wherein the functional task structure does not comprise a rhetorical structure of the parsed spoken dialogs; predicting a likely next dialog act using the functional task structure, the feature vectors, and the hierarchical, parsed-based dialog model; and selecting a language model for a next utterance based on the likely next dialog act. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising:
-
training a plurality of hierarchical, parsed-based dialog models comprising a shift-reduce model, a start-complete model, and a connection path model, wherein the plurality of hierarchical, parsed-based dialog models operate incrementally from left to right and only analyze an immediately preceding dialog context; parsing, via a processor, spoken dialogs with a hierarchical, parse-based dialog model from the plurality of hierarchical, parsed-based dialog models, to yield parsed spoken dialogs, wherein the spoken dialogs are annotated to indicate dialog acts, feature vectors, and task/subtask information; constructing a functional task structure of the parsed spoken dialogs, wherein the functional task structure does not comprise a rhetorical structure of the parsed spoken dialogs; predicting a likely next dialog act using the functional task structure, the feature vectors, and the hierarchical, parsed-based dialog model; and selecting a language model for a next utterance based on the likely next dialog act. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification