Evaluating conversation data based on risk factors
First Claim
1. One or more non-transitory computer-readable media storing computer-readable instructions that, when executed, instruct one or more processors to perform acts comprising:
- causing a conversation user interface to be displayed on a display device to enable a conversation between a user and a virtual assistant;
receiving, via the conversation user interface, user input;
processing the user input with one or more natural language processing techniques to identify an intent unit for the user input, the intent unit being associated with a language model for the natural language processing techniques;
determining, based at least in part on a presence of one or more risk indicators for the user input, a measure of confidence that the intent unit is correctly identified for the user input;
associating the measure of confidence with the user input;
determining a health status of the intent unit based at least in part on the measure of confidence, the health status indicating a level of risk associated with the intent unit;
presenting the health status of the intent unit via an output device associated with an administrator;
receiving a selection of the intent unit to obtain feedback regarding the intent unit;
presenting a feedback interface that enables a voter to provide feedback regarding matching of the user input to the intent unit;
receiving feedback for the voter regarding an accuracy of matching the user input to the intent unit; and
evaluating the intent unit based at least in part on the feedback;
determining that the feedback indicates that the matching of the user input to the intent unit is not accurate;
increasing a weighting to be applied to the one or more risk indicators based at least in part on the determining that the feedback indicates that the matching of the user input to the intent unit is not accurate; and
applying the weighting to the one or more risk indicators;
wherein the determining the measure of confidence includes determining the measure of confidence based at least in part on the presence of the one or more weighted risk indicators.
2 Assignments
0 Petitions
Accused Products
Abstract
This disclosure describes techniques and architectures for evaluating conversations. In some instances, conversations with users, virtual assistants, and others may be analyzed to identify potential risks within a language model that is employed by the virtual assistants and other entities. The potential risks may be evaluated by administrators, users, systems, and others to identify potential issues with the language model that need to be addressed. This may allow the language model to be improved and enhance user experience with the virtual assistants and others that employ the language model.
-
Citations
20 Claims
-
1. One or more non-transitory computer-readable media storing computer-readable instructions that, when executed, instruct one or more processors to perform acts comprising:
-
causing a conversation user interface to be displayed on a display device to enable a conversation between a user and a virtual assistant; receiving, via the conversation user interface, user input; processing the user input with one or more natural language processing techniques to identify an intent unit for the user input, the intent unit being associated with a language model for the natural language processing techniques; determining, based at least in part on a presence of one or more risk indicators for the user input, a measure of confidence that the intent unit is correctly identified for the user input; associating the measure of confidence with the user input; determining a health status of the intent unit based at least in part on the measure of confidence, the health status indicating a level of risk associated with the intent unit; presenting the health status of the intent unit via an output device associated with an administrator; receiving a selection of the intent unit to obtain feedback regarding the intent unit; presenting a feedback interface that enables a voter to provide feedback regarding matching of the user input to the intent unit; receiving feedback for the voter regarding an accuracy of matching the user input to the intent unit; and evaluating the intent unit based at least in part on the feedback; determining that the feedback indicates that the matching of the user input to the intent unit is not accurate; increasing a weighting to be applied to the one or more risk indicators based at least in part on the determining that the feedback indicates that the matching of the user input to the intent unit is not accurate; and applying the weighting to the one or more risk indicators; wherein the determining the measure of confidence includes determining the measure of confidence based at least in part on the presence of the one or more weighted risk indicators. - View Dependent Claims (2, 3, 4, 5)
-
-
6. One or more non-transitory computer-readable media storing computer-readable instructions that, when executed, instruct one or more processors to perform acts comprising:
-
causing a conversation user interface to be displayed on a display device to enable a conversation between a user and a virtual assistant; receiving, via the conversation user interface, user input; processing the user input with one or more natural language processing techniques to identify an intent unit for the user input, the intent unit being associated with a language model for the natural language processing techniques; determining, based at least in part on a presence of one or more risk indicators for the user input, a measure of confidence that the intent unit is correctly identified for the user input; associating the measure of confidence with the user input; determining a health status of the intent unit based at least in part on the measure of confidence, the health status indicating a level of risk associated with the intent unit; presenting the health status of the intent unit via an output device associated with an administrator; receiving a selection of the intent unit to obtain feedback regarding the intent unit; presenting a feedback interface that enables a voter to provide feedback regarding matching of the user input to the intent unit; receiving feedback for the voter regarding an accuracy of matching the user input to the intent unit; and evaluating the intent unit based at least in part on the feedback; determining that the feedback indicates that the matching of the user input to the intent unit is accurate; decreasing a weighting to be applied to the one or more risk indicators based at least in part the on determining that the feedback indicates that the matching of the user input to the intent unit is accurate; and applying the weighting to the one or more risk indicators; wherein the determining the measure of confidence includes determining the measure of confidence based at least in part on the presence of the one or more weighted risk indicators.
-
-
7. A method comprising:
-
under control of a computing device configured with executable instructions, identifying one or more risk factors to evaluate conversation data associated with one or more users, the conversation data representing at least one conversation; processing the conversation data with a natural language processing system to identify an intent unit for the conversation data; determining a confidence value for the conversation data based at least in part on the one or more risk factors, the confidence value indicating a level of confidence that the intent unit is accurately identified for the conversation data; and wherein the one or more risk factors include a plurality of risk factors; and
the determining includes;applying a weighting each of the plurality of risk factors; and determining the confidence value based at least in part on the plurality of weighted risk factors; determining a health status of the intent unit based at least in part on the measure of confidence, the health status indicating a level of risk associated with the intent unit; presenting the health status of the intent unit via an output device associated with an administrator; receiving a selection of the intent unit to obtain feedback regarding the intent unit; presenting a feedback interface that enables a voter to provide feedback regarding matching of the user input to the intent unit; receiving feedback for the voter regarding an accuracy of matching the user input to the intent unit; and evaluating the intent unit based at least in part on the feedback; determining that the feedback indicates that the matching of the user input to the intent unit is accurate; decreasing a weighting to be applied to the one or more risk indicators based at least in part the on determining that the feedback indicates that the matching of the user input to the intent unit is accurate; and applying the weighting to the one or more risk indicators; wherein the determining the measure of confidence includes determining the measure of confidence based at least in part on the presence of the one or more weighted risk indicators. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system comprising:
-
one or more processors; a database storing (i) conversation data representing at least one conversation and (ii) one or more risk factors; memory communicatively coupled to the one or more processors and storing executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising; processing the conversation data with a natural language processing system to identify an intent for the conversation data; determining a confidence value for the conversation data based at least in part on the one or more risk factors, the confidence value indicating a level of confidence that the intent is accurately identified for the conversation data; and utilizing the confidence value; wherein the acts further comprise; presenting a feedback interface that enables a voter to vote regarding matching of the user input to the intent; determining a profile factor for the voter based at least in part on a comparison of a previous number of votes from the voter to votes from a majority of voters, the majority of voters including more than a predetermined number of voters; and applying a weighting to the vote based at least in part on the profile factor of the voter.
-
Specification