EFFICIENT PASSAGE RETRIEVAL USING DOCUMENT METADATA
First Claim
1. A computer program product for efficiently retrieving relevant passages to questions based on a corpus of data, the computer program device comprising a non-transitory storage medium readable by a processing circuit and storing instructions run by the processing circuit for performing a method, the method comprising:
- receiving an input query;
performing a query context analysis upon said input query to obtain searchable query terms;
matching metadata associated with one or more documents against said query terms;
mapping matched document metadata to corresponding one or more documents;
identifying corresponding matched documents to form a subcorpus of documents; and
conducting a search in said data subcorpus using said searchable query terms to obtain one or more passages relevant to the input query from said identified.
0 Assignments
0 Petitions
Accused Products
Abstract
A system, method and computer program product for efficiently retrieving relevant passages to questions based on a corpus of data. A processor device receives an input query and performs a query analysis to obtain searchable query terms. The processor performs: matching metadata associated with one or more documents against the query terms. The document metadata includes one or more of: a title of the documents, one or more user tags or clouds. Then the processor device performs: mapping matched document metadata to corresponding one or more documents; identifying corresponding matched documents to form a subcorpus of documents; and conducting a search in the data subcorpus using the searchable query terms to obtain one or more passages relevant input query from the identified documents.
21 Citations
13 Claims
-
1. A computer program product for efficiently retrieving relevant passages to questions based on a corpus of data, the computer program device comprising a non-transitory storage medium readable by a processing circuit and storing instructions run by the processing circuit for performing a method, the method comprising:
-
receiving an input query; performing a query context analysis upon said input query to obtain searchable query terms; matching metadata associated with one or more documents against said query terms; mapping matched document metadata to corresponding one or more documents; identifying corresponding matched documents to form a subcorpus of documents; and conducting a search in said data subcorpus using said searchable query terms to obtain one or more passages relevant to the input query from said identified. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for efficiently retrieving relevant passages to questions based on a corpus of data comprising:
-
a memory storage device; a processor device in communication with the memory device that performs a method comprising; receiving an input query; performing a query context analysis upon said input query to obtain searchable query terms; matching metadata associated with one or more documents against said query terms; mapping matched document metadata to corresponding one or more documents; identifying corresponding matched documents to form a subcorpus of documents; and conducting a search in said data subcorpus using said searchable query terms to obtain one or more passages relevant to the input query from said identified documents. - View Dependent Claims (10, 11)
-
-
12. A computer program product for efficiently retrieving relevant passages to questions based on a corpus of data, the computer program device comprising a storage medium readable by a processing circuit and storing instructions run by the processing circuit for performing a method, the method comprising:
-
receiving, at a processor device, an input query; performing, at said processor device, a query context analysis upon said input query to obtain searchable query terms; accessing a dictionary of document metadata obtained from one or more documents of the data corpus, each stored document metadata being associated with a corresponding document identification (ID); performing, by said processor device, a dictionary matching of said metadata associated with one or more documents against said query terms; mapping matched document metadata to corresponding one or more document IDs; identifying corresponding matched documents to form a subcorpus of documents; and conducting a search in said subcorpus using said searchable query terms to obtain one or more passages relevant to the input query from said identified documents. - View Dependent Claims (13)
-
Specification