Using cohorts to infer attributes for an input case in a question answering system
First Claim
1. A method for answering a question in a question answering system, the method comprising the steps of:
- analyzing an input case representing the question to determine whether cohorts can be used with the input case, wherein the cohorts are similar entities with similar characteristics to an entity of the input case and available in a corpus of data;
using cohort analysis to infer an attribute for the input case comprising;
determining if the input case has an entity that has available cohorts;
identifying cohorts in the corpus of data using fuzzy matching with a threshold percentage and where the cohorts are similar to the entity in the input case;
analyzing the cohorts to find common attributes in the cohorts;
finding missing attributes for the entity of the input case from the common attributes in the cohorts to infer a missing attribute for the input case;
scoring the missing attributes according to consistency and sample size;
applying confidence thresholds to the missing attributes to determine whether to infer the attribute for the refined input case;
determining whether to dialog with a user to confirm the inferred attribute using a lower threshold compared to inferring the attribute without dialoging with the user;
generating a refined input case that includes the inferred attribute applied to the entity of the input case; and
using the refined input case with the inferred attribute to answer the question.
1 Assignment
0 Petitions
Accused Products
Abstract
A cohort analysis mechanism analyzes cohorts to infer one or more additional attributes for an input case to provide a refined input case to answer a question in a question answering system. The refined input case is then used to answer a question in the question answering system. The refined input case can be used in a traditional question answering flow or in a flow that again uses cohort analysis to extract relevant data to answer the question. The cohort analysis mechanism analyzes cohorts to find common attributes in the cohorts and then determines whether to infer the common attributes into the refined input case. The cohort analysis mechanism may determine to dialog with a user to confirm an inferred attribute.
-
Citations
6 Claims
-
1. A method for answering a question in a question answering system, the method comprising the steps of:
-
analyzing an input case representing the question to determine whether cohorts can be used with the input case, wherein the cohorts are similar entities with similar characteristics to an entity of the input case and available in a corpus of data; using cohort analysis to infer an attribute for the input case comprising; determining if the input case has an entity that has available cohorts; identifying cohorts in the corpus of data using fuzzy matching with a threshold percentage and where the cohorts are similar to the entity in the input case; analyzing the cohorts to find common attributes in the cohorts; finding missing attributes for the entity of the input case from the common attributes in the cohorts to infer a missing attribute for the input case; scoring the missing attributes according to consistency and sample size; applying confidence thresholds to the missing attributes to determine whether to infer the attribute for the refined input case; determining whether to dialog with a user to confirm the inferred attribute using a lower threshold compared to inferring the attribute without dialoging with the user; generating a refined input case that includes the inferred attribute applied to the entity of the input case; and using the refined input case with the inferred attribute to answer the question. - View Dependent Claims (2, 3, 4)
-
-
5. A method for utilizing cohorts in a question answering system, the method comprising:
-
analyzing an input case in a question answering application that answers a natural language question to determine whether cohorts can be used with the input case, wherein the cohorts are similar entities with similar characteristics to an entity of the input case and available in a corpus of data; using cohort analysis to infer an attribute for the input case comprising; determining if the input case has an entity that has available cohorts; identifying cohorts in the corpus of data using fuzzy matching with a threshold percentage and where the cohorts are similar to the entity in the input case; analyzing the cohorts to find common attributes in the cohorts and finding missing attributes for the entity of the input case for the common attributes in the cohorts to infer a missing attribute for the input case; scoring the missing attributes according to consistency and sample size; applying confidence thresholds to the missing attributes to determine whether to infer the attribute for the refined input case; and determining whether to dialog with a user to confirm the inferred attribute using a lower threshold compared to inferring the attribute without dialoging with the user; providing a refined input case that includes the inferred attribute; and answering a question with the refined input case. - View Dependent Claims (6)
-
Specification