Question classification and feature mapping in a deep question answering system
First Claim
1. A computer program product to identify features by a question answering system, the computer program product comprising:
- a non-transitory computer-readable storage medium having computer-readable program code embodied therewith, the computer-readable program code executable by a processor to perform an operation comprising;
classifying a first case received by the question answering system as being of a first type; and
training the question answering system to generate a response to the first case, wherein the training comprises;
identifying a first feature and identifying a first feature and a second feature of the first case, wherein the first and second features comprise a first and a second variable of the first case, respectively;
determining that the question answering system identified evidence based on the first and second features of the first case;
identifying a set of possible values for the first and second variables in the identified evidence, wherein the set of possible values for the first and second variables include one or more of;
(i) a first value comprising reference value for the respective variables, (ii) a second value comprising an actual value for the respective variables, and (iii) a third value comprising an opinion value for the respective variables;
determining that the question answering system generated the response to the first case based on the identified evidence and a weight applied to a combined feature comprising the first and second features, wherein the weight is of a range of weights applied by the question answering system to the combined feature;
computing a first feature score for the first feature, wherein the first feature score indicates the weight applied by the question answering system to the combined feature and the sets of possible values for the first and second variables when generating the response to the first case; and
storing an indication of a relationship between the first feature, the second feature, the combined feature, the identified evidence, and cases classified as being of the first type upon determining that the first feature score exceeds a threshold, wherein the relationship reflects the weight applied by the question answering system to the combined feature when generating responses to cases classified as being of the first type;
wherein the operation further comprises;
subsequent to training the question answering system and responsive to receiving a second case by the question answering system;
computing a similarity score for the first case and the second case;
upon determining that the similarity score exceeds a specified similarity threshold, classifying the second case as being of the first type; and
while generating a response to the second case;
refraining from processing a first candidate answer to determine whether to return the first candidate answer as responsive to the second case upon determining the first candidate answer does not include the combined feature, thereby reducing an amount of time and processing resources required to generate a response to the second case relative to the amount of time and processing resources required to generate a response to the second case by processing the first candidate answer; and
processing a second candidate answer to determine whether to return the second candidate answer as responsive to the second case upon determining the second candidate answer includes the combined feature.
2 Assignments
0 Petitions
Accused Products
Abstract
System, method, and computer program product to identify relevant features in a deep question answering system, by classifying a first case received by the deep question answering system, and, while training the deep question answering system to answer the first case, identifying a first feature in the first case, computing a first feature score for the first feature, the first feature score indicating a relevance of the first feature in generating a correct response to the first case, and, identifying the first feature as relevant in answering the classified first case upon determining that the first feature score exceeds a relevance threshold.
77 Citations
14 Claims
-
1. A computer program product to identify features by a question answering system, the computer program product comprising:
-
a non-transitory computer-readable storage medium having computer-readable program code embodied therewith, the computer-readable program code executable by a processor to perform an operation comprising; classifying a first case received by the question answering system as being of a first type; and training the question answering system to generate a response to the first case, wherein the training comprises; identifying a first feature and identifying a first feature and a second feature of the first case, wherein the first and second features comprise a first and a second variable of the first case, respectively; determining that the question answering system identified evidence based on the first and second features of the first case; identifying a set of possible values for the first and second variables in the identified evidence, wherein the set of possible values for the first and second variables include one or more of;
(i) a first value comprising reference value for the respective variables, (ii) a second value comprising an actual value for the respective variables, and (iii) a third value comprising an opinion value for the respective variables;determining that the question answering system generated the response to the first case based on the identified evidence and a weight applied to a combined feature comprising the first and second features, wherein the weight is of a range of weights applied by the question answering system to the combined feature; computing a first feature score for the first feature, wherein the first feature score indicates the weight applied by the question answering system to the combined feature and the sets of possible values for the first and second variables when generating the response to the first case; and storing an indication of a relationship between the first feature, the second feature, the combined feature, the identified evidence, and cases classified as being of the first type upon determining that the first feature score exceeds a threshold, wherein the relationship reflects the weight applied by the question answering system to the combined feature when generating responses to cases classified as being of the first type; wherein the operation further comprises; subsequent to training the question answering system and responsive to receiving a second case by the question answering system; computing a similarity score for the first case and the second case; upon determining that the similarity score exceeds a specified similarity threshold, classifying the second case as being of the first type; and while generating a response to the second case; refraining from processing a first candidate answer to determine whether to return the first candidate answer as responsive to the second case upon determining the first candidate answer does not include the combined feature, thereby reducing an amount of time and processing resources required to generate a response to the second case relative to the amount of time and processing resources required to generate a response to the second case by processing the first candidate answer; and processing a second candidate answer to determine whether to return the second candidate answer as responsive to the second case upon determining the second candidate answer includes the combined feature. - View Dependent Claims (2, 3, 4, 5, 6, 14)
-
-
7. A system, comprising:
-
one or more computer processors; and a memory containing a program, which, when executed by the one or more computer processors, performs an operation to identify features by a question answering system, the operation comprising; training the question answering system to generate a response to the first case, wherein the training comprises; identifying a first feature and a second feature of the first case, wherein the first and second features comprise a first and a second variable of the first case, respectively; determining that the question answering system identified evidence based on the first and second features of the first case; identifying a set of possible values for the first and second variables in the identified evidence, wherein the set of possible values for the first and second variables include one or more of;
(i) a first value comprising reference value for the respective variables, (ii) a second value comprising an actual value for the respective variables, and (iii) a third value comprising an opinion value for the respective variables;determining that the question answering system generated the response to the first case based on the identified evidence and a weight applied to a combined feature comprising the first and second features, wherein the weight is of a range of weights applied by the question answering system to the combined feature; computing a first feature score for the first feature, wherein the first feature score indicates the weight applied by the question answering system to the combined feature and the sets of possible values for the first and second variables when generating the response to the first case; and storing an indication of a relationship between the first feature, the second feature, the combined feature, the identified evidence, and cases classified as being of the first type upon determining that the first feature score exceeds a threshold, wherein the relationship reflects the weight applied by the question answering system to the combined feature when generating responses to cases classified as being of the first type; wherein the operation further comprises; subsequent to training the question answering system and responsive to receiving a second case by the question answering system; computing a similarity score for the first case and the second case; upon determining that the similarity score exceeds a specified similarity threshold, classifying the second case as being of the first type; and while generating a response to the second case; refraining from processing a first candidate answer to determine whether to return the first candidate answer as responsive to the second case upon determining the first candidate answer does not include the combined feature, thereby reducing an amount of time and processing resources required to generate a response to the second case relative to the amount of time and processing resources required to generate a response to the second case by processing the first candidate answer; and processing a second candidate answer to determine whether to return the second candidate answer as responsive to the second case upon determining the second candidate answer includes the combined feature. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
Specification