Real-time search of vertically partitioned, inverted indexes
First Claim
Patent Images
1. A method for processing a query, comprising:
- receiving, using a processor of a computer, the query that includes a document constraint and an annotation constraint;
parsing, using the processor of the computer, the query to separate the document constraint from the annotation constraint and to create a query parse tree with a primary query processor at a lowest level of the query parse tree and with an auxiliary query processor at the lowest level of the query parse tree;
processing, using the processor of the computer, the document constraint with the primary query processor to generate a first posting list that is ordered by document identifier;
processing, using the processor of the computer, the annotation constraint with the auxiliary query processor to generate a second posting list that is ordered by annotation identifier and that includes the document identifier associated with each annotation that is identified by the annotation identifier and that is re-ordered by the document identifier;
evaluating, using the processor of the computer, the query parse tree with the primary query processor and with the auxiliary query processor by iterating through the first posting list and the second posting list; and
performing, using the processor of the computer, a run-time join of the first posting list and the second posting list to obtain a final result set that combines documents and annotations that have a same document identifier with a union operation.
0 Assignments
0 Petitions
Accused Products
Abstract
Provided are techniques for processing a query. A query including constraints for at least two vertically partitioned, inverted indexes is received. The constraints in the query are separated based on the vertically partitioned, inverted indexes. A document identifier iterator is obtained for each of the constraints, wherein each document identifier iterator is associated with a posting list, and wherein each posting list is ordered by document identifier order. A run-time join of the posting lists is performed to obtain a final result set.
-
Citations
6 Claims
-
1. A method for processing a query, comprising:
-
receiving, using a processor of a computer, the query that includes a document constraint and an annotation constraint; parsing, using the processor of the computer, the query to separate the document constraint from the annotation constraint and to create a query parse tree with a primary query processor at a lowest level of the query parse tree and with an auxiliary query processor at the lowest level of the query parse tree; processing, using the processor of the computer, the document constraint with the primary query processor to generate a first posting list that is ordered by document identifier; processing, using the processor of the computer, the annotation constraint with the auxiliary query processor to generate a second posting list that is ordered by annotation identifier and that includes the document identifier associated with each annotation that is identified by the annotation identifier and that is re-ordered by the document identifier; evaluating, using the processor of the computer, the query parse tree with the primary query processor and with the auxiliary query processor by iterating through the first posting list and the second posting list; and performing, using the processor of the computer, a run-time join of the first posting list and the second posting list to obtain a final result set that combines documents and annotations that have a same document identifier with a union operation. - View Dependent Claims (2, 3, 4, 5, 6)
-
Specification