System and method for providing answers to questions
First Claim
1. A computer implemented method of generating answers to natural language questions, said method comprising:
- receiving said natural language question;
processing said question using a plurality of natural language processing techniques to obtain a plurality of searchable components;
conducting a first search in a corpus based on said plurality of searchable components to obtain a plurality of relevant documents;
analyzing said plurality documents to generate a plurality of candidate answers;
after said candidate answers are generated, retrieving a plurality of supporting passages from said corpus based on said candidate answers;
analyzing said candidate answers based on grammatical and semantic structures in said query and in said retrieved supporting passages;
generating a confidence score for each analyzed candidate answer, said confidence score based on a ratio of the number of query terms present in a supporting passage with a total number of the searchable components of the received question;
and outputting said candidate answers and associated confidence scores.
1 Assignment
0 Petitions
Accused Products
Abstract
System and method for providing answers to questions based on any corpus of data implements a method that generates a number of candidate passages from the corpus that answer an input query, and finds the correct resulting answer by collecting supporting evidence from the multiple passages. By analyzing all retrieved passages and that passage'"'"'s metadata in parallel, an output plurality of data structures is generated including candidate answers based upon the analyzing. Then, supporting passage retrieval operations are performed upon the set of candidate answers, and for each candidate answer, the data corpus is traversed to find those passages having candidate answer in addition to query terms. All candidate answers are automatically scored by a plurality of scoring modules, each producing a module score. The modules scores are processed to determine one or more query answers; and, a query response is generated based on the one or more query answers.
142 Citations
25 Claims
-
1. A computer implemented method of generating answers to natural language questions, said method comprising:
-
receiving said natural language question; processing said question using a plurality of natural language processing techniques to obtain a plurality of searchable components; conducting a first search in a corpus based on said plurality of searchable components to obtain a plurality of relevant documents; analyzing said plurality documents to generate a plurality of candidate answers; after said candidate answers are generated, retrieving a plurality of supporting passages from said corpus based on said candidate answers; analyzing said candidate answers based on grammatical and semantic structures in said query and in said retrieved supporting passages; generating a confidence score for each analyzed candidate answer, said confidence score based on a ratio of the number of query terms present in a supporting passage with a total number of the searchable components of the received question; and outputting said candidate answers and associated confidence scores. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A system for generating answers to natural language questions comprising:
-
a memory storage device; a hardware processor in communication with said memory storage device and configured to; receive said natural language question; process said question using a plurality of natural language processing techniques to obtain a plurality of searchable components; conduct a first search in a corpus based on said plurality of searchable components to obtain a plurality of relevant documents; analyze said plurality documents to generate a plurality of candidate answers; after said candidate answers are generated, retrieve a plurality of supporting passages from said corpus based on said candidate answers; analyzing said candidate answers based on grammatical and semantic structures in said query and in said supporting passages; generate a confidence score for each analyzed candidate answer, said confidence score based on a ratio of the number of query terms present in a supporting passage with a total number of the searchable components of the received question; and output said candidate answers and associated confidence scores. - View Dependent Claims (18, 19, 20, 21)
-
-
22. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform method steps for generating answers to natural language questions, said method steps including the steps of:
- receiving said natural language question;
processing said question using a plurality of natural language processing techniques to obtain a plurality of searchable components;
conducting a first search in a corpus based on said plurality of searchable components to obtain a plurality of relevant documents;
analyzing said plurality documents to generate a plurality of candidate answers;
after said candidate answers are generated, retrieving a plurality of supporting passages from said corpus based on said candidate answers;analyzing said candidate answers based on grammatical and semantic structures in said query and in said supporting passages; generating a confidence score for each analyzed candidate answer, said confidence score based on a ratio of the number of query terms present in a supporting passage with a total number of the searchable components of the received question; and outputting said candidate answers and associated confidence scores. - View Dependent Claims (23, 24, 25)
- receiving said natural language question;
Specification