Providing answers to questions having both rankable and probabilistic components
First Claim
1. A computer program product for providing answers to questions, the computer program product comprising:
- one or more computer-readable storage media and program instructions stored on the one or more computer-readable storage media, the program instructions comprising;
program instructions to receive a query;
program instructions to identify a lexical answer type (LAT), a rankable criterion, and a probabilistic criterion in the query, wherein a rankable criterion refers to components of the query that contain ordinals and/or superlatives, and a probabilistic criterion refers to components of the query that contain phrases that modify the LAT;
program instructions to generate a set of candidate answers to the query that are instances of the LAT identified in a structured data source;
program instructions to rank to each candidate answer, based on the rankable criterion;
program instructions to assign to each candidate answer a likelihood that the candidate answer satisfies the probabilistic criterion, based on a statistic of occurrences of terms related to the candidate answer and to the probabilistic criterion in text passages from an unstructured data source;
program instructions to select one or more candidate answers, based on the product of the likelihood that the candidate answer satisfies the probabilistic criterion and the likelihoods that each candidate answer of lower rank does not satisfy the probabilistic criterion; and
program instructions to transmit the selected candidate answers.
1 Assignment
0 Petitions
Accused Products
Abstract
Providing answers to questions. A LAT, a rankable criterion, and a probabilistic criterion are identified in a received query. A set of candidate answers to the question that are instances of the LAT identified in a first data source is generated. Based on the rankable criterion, the candidate answers are ranked. Each candidate answer is assigned a likelihood that the candidate answer satisfies the probabilistic criterion, based on a statistic of occurrences of terms related to the candidate answer and the probabilistic criterion in text passages from a second data source. One or more candidate answers are selected based on the rank and the likelihood of the candidate answers. In another aspect of the invention, a Boolean criterion is also identified in the query and the set of candidate answers is reduced by applying the Boolean criterion.
10 Citations
8 Claims
-
1. A computer program product for providing answers to questions, the computer program product comprising:
-
one or more computer-readable storage media and program instructions stored on the one or more computer-readable storage media, the program instructions comprising; program instructions to receive a query; program instructions to identify a lexical answer type (LAT), a rankable criterion, and a probabilistic criterion in the query, wherein a rankable criterion refers to components of the query that contain ordinals and/or superlatives, and a probabilistic criterion refers to components of the query that contain phrases that modify the LAT; program instructions to generate a set of candidate answers to the query that are instances of the LAT identified in a structured data source; program instructions to rank to each candidate answer, based on the rankable criterion; program instructions to assign to each candidate answer a likelihood that the candidate answer satisfies the probabilistic criterion, based on a statistic of occurrences of terms related to the candidate answer and to the probabilistic criterion in text passages from an unstructured data source; program instructions to select one or more candidate answers, based on the product of the likelihood that the candidate answer satisfies the probabilistic criterion and the likelihoods that each candidate answer of lower rank does not satisfy the probabilistic criterion; and program instructions to transmit the selected candidate answers. - View Dependent Claims (2, 3, 4)
-
-
5. A computer system for providing answers to questions, the computer system comprising:
-
one or more computer processors, one or more computer-readable storage media, and program instructions stored on one or more of the computer-readable storage media for execution by at least one of the one or more processors, the program instructions comprising; program instructions to receive a query; program instructions to identify a lexical answer type (LAT), a rankable criterion, and a probabilistic criterion in the query, wherein a rankable criterion refers to components of the query that contain ordinals and/or superlatives, and a probabilistic criterion refers to components of the query that contain phrases that modify the LAT; program instructions to generate a set of candidate answers to the query that are instances of the LAT identified in a structured data source; program instructions to rank to each candidate answer, based on the rankable criterion; program instructions to assign to each candidate answer a likelihood that the candidate answer satisfies the probabilistic criterion, based on a statistic of occurrences of terms related to the candidate answer and to the probabilistic criterion in text passages from an unstructured data source; program instructions to select one or more candidate answers, based on the product of the likelihood that the candidate answer satisfies the probabilistic criterion and the likelihoods that each candidate answer of lower rank does not satisfy the probabilistic criterion; and program instructions to transmit the selected candidate answers. - View Dependent Claims (6, 7, 8)
-
Specification