Metric for automatic assessment of conversational responses
First Claim
1. A computer-implemented method for automatic assessment of machine generated responses, said method comprising:
- extracting candidate context-message-response n-tuples, by an extraction component of a computing device, from at least one source of conversational data;
forming a set of multi-reference responses selected from the candidate context-message-response n-tuples extracted by the extraction component;
calculating an assessment metric for the machine generated response, by at least one processor, based on the set of multi-reference responses; and
generating a metric score for the machine generated response based on the assessment metric, by the at least one processor, the metric score indicating a quality of the machine-generated response relative to the set of multi-reference responses.
1 Assignment
0 Petitions
Accused Products
Abstract
Examples are generally directed towards automatic assessment of machine generated conversational responses. Context-message-response n-tuples are extracted from at least one source of conversational data to generate a set of multi-reference responses. A response in the set of multi-reference responses includes it context-message data pair and rating. The rating indicates a quality of the response relative to the context-message data pair. A response assessment engine generates a metric score for a machine-generated response based on an assessment metric and the set of multi-reference responses. The metric score indicates a quality of the machine-generated conversational response relative to a user-generated message and a context of the user-generated message. A response generation system of a computing device, such as a digital assistant, is optimized and adjusted based on the metric score to improve the accuracy, quality, and relevance of responses output to the user.
-
Citations
20 Claims
-
1. A computer-implemented method for automatic assessment of machine generated responses, said method comprising:
-
extracting candidate context-message-response n-tuples, by an extraction component of a computing device, from at least one source of conversational data; forming a set of multi-reference responses selected from the candidate context-message-response n-tuples extracted by the extraction component; calculating an assessment metric for the machine generated response, by at least one processor, based on the set of multi-reference responses; and generating a metric score for the machine generated response based on the assessment metric, by the at least one processor, the metric score indicating a quality of the machine-generated response relative to the set of multi-reference responses. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for automatic assessment of machine generated responses, said system comprising:
-
at least one processor; and a memory storage device associated with the at least one processor, the memory storage device comprising a memory area storing a response assessment engine, wherein the at least one processor executes the response assessment engine to; calculate an assessment metric for at least one machine-generated response, based on a set of multi-reference responses, a set of ratings and contextual data being associated with the set of multi-reference responses; generate at least one metric score indicating a quality of the at least one machine-generated response relative to at least one multi-reference response from the set of multi-reference responses; and update a set of parameters associated with the response generation system based on the at least one metric score. - View Dependent Claims (13, 14, 15, 16)
-
-
17. One or more computer storage media embodying computer-executable components, said components comprising:
-
an extraction component that when executed causes at least one processor to; extract a plurality of candidate context-message-response n-tuples from at least one source of conversational data; and select at least one candidate context-message-response n-tuple from the plurality of candidate context-message-response n-tuples associated with a machine-generated response to form a set of multi-reference responses; and a response assessment engine that when executed causes at least one processor to; generate a metric score for the machine-generated response based on the set of multi-reference responses, a conversational context of the machine-generated response, and an assessment metric, the metric score indicating a quality of the machine-generated response relative to the set of multi-reference responses. - View Dependent Claims (18, 19, 20)
-
Specification