System and method for providing answers to questions
First Claim
1. A computer-implemented method of generating answers to questions based on a corpus of data, said method comprising:
- receiving an input query;
breaking said input query into query terms, said query terms comprising searchable components;
conducting a first search in said corpus of data using one of more of said searchable components to obtain documents including candidate answers, wherein all documents including candidate answers are stored in a data storage device;
analyzing said obtained documents and each document'"'"'s metadata to generate a set of candidate answers;
conducting a second search in said corpus of data using said candidate answers and said searchable components of said query terms to obtain one or more supporting passages, wherein said supporting passages have at least one of said candidate answers and at least one of said searchable components of said query terms;
scoring said candidate answers using said supporting passages, wherein said scoring is carried out by a plurality of parallel implemented scoring modules, each scoring module producing a candidate score component, said candidate score components comprising a term match score, a textual alignment score and semantic relation score;
combining each of said candidate score components to determine an overall candidate score for each of said candidate answers;
selecting one or more query answers based on said candidate score;
generating a query response based on said one or more query answers for delivery to a user.
0 Assignments
0 Petitions
Accused Products
Abstract
Providing answers to questions based on any corpus of data implements a method that generates a number of candidate passages from the corpus that answer an input query, and finds the correct resulting answer by collecting supporting evidence from the multiple passages. By analyzing all retrieved passages and that passage'"'"'s metadata in parallel, an output plurality of data structures is generated including candidate answers based upon the analyzing. Then, supporting passage retrieval operations are performed upon the set of candidate answers, and for each candidate answer, the data corpus is traversed to find those passages having candidate answer in addition to query terms. All candidate answers are automatically scored by a plurality of scoring modules, each producing a module score. The modules scores are processed to determine one or more query answers; and, a query response is generated based on the one or more query answers.
-
Citations
29 Claims
-
1. A computer-implemented method of generating answers to questions based on a corpus of data, said method comprising:
-
receiving an input query; breaking said input query into query terms, said query terms comprising searchable components; conducting a first search in said corpus of data using one of more of said searchable components to obtain documents including candidate answers, wherein all documents including candidate answers are stored in a data storage device; analyzing said obtained documents and each document'"'"'s metadata to generate a set of candidate answers; conducting a second search in said corpus of data using said candidate answers and said searchable components of said query terms to obtain one or more supporting passages, wherein said supporting passages have at least one of said candidate answers and at least one of said searchable components of said query terms; scoring said candidate answers using said supporting passages, wherein said scoring is carried out by a plurality of parallel implemented scoring modules, each scoring module producing a candidate score component, said candidate score components comprising a term match score, a textual alignment score and semantic relation score; combining each of said candidate score components to determine an overall candidate score for each of said candidate answers; selecting one or more query answers based on said candidate score; generating a query response based on said one or more query answers for delivery to a user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for generating answers to questions based on any corpus of data comprising:
-
a memory storage device; a processor in communication with said memory storage device and configured to perform a method comprising; receiving an input query; breaking said input query into query terms, said query terms comprising searchable components; conducting a first search in said corpus of data using one of more of said searchable components to obtain documents including candidate answers, wherein all documents including candidate answers are stored in a data storage device; analyzing said obtained documents and each document'"'"'s metadata to generate a set of candidate answers; conducting a second search in said corpus of data using said candidate answers and said searchable components of said query terms to obtain one or more supporting passages, wherein said supporting passages have at least one of said candidate answers and at least one of said searchable components of said query terms; scoring said candidate answers using said supporting passages, wherein said scoring is carried out by a plurality of parallel implemented scoring modules, each scoring module producing a candidate score component, said candidate score components comprising a term match score, a textual alignment score and semantic relation score; combining each of said candidate score components to determine an overall candidate score for each of said candidate answers; selecting one or more query answers based on said candidate score; generating a query response based on said one or more query answers for delivery to a user. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22)
-
-
23. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform method steps for generating answers to questions based on any corpus of data, said method steps including the steps of:
-
receiving an input query; breaking said input query into query terms, said query terms comprising searchable components; conducting a first search in said corpus of data using one of more of said searchable components to obtain documents including candidate answers, wherein all documents including candidate answers are stored in a data storage device; analyzing obtained documents and each document'"'"'s metadata to generate a set of candidate answers; conducting a second search in said corpus of data using said candidate answers and said searchable components of said query terms to obtain one or more supporting passages, wherein said supporting passages have at least one of said candidate answers and at least one of said searchable components of said query terms; scoring said candidate answers using said supporting passages, wherein said scoring is carried out by a plurality of parallel implemented scoring modules, each scoring module producing a candidate score component, said candidate score components comprising a term match score, a textual alignment score and semantic relation score; combining each of said candidate score components to determine an overall candidate score for each of said candidate answers; selecting one or more query answers based on said candidate score; generating a query response based on said one or more query answers for delivery to a user. - View Dependent Claims (24, 25, 26, 27, 28, 29)
-
Specification