SUPPORTING SUB-DOCUMENT UPDATES AND QUERIES IN AN INVERTED INDEX
First Claim
Patent Images
1. A method of updating a partitioned index of a dataset comprising:
- indexing a document by separating a document into sections, wherein at least one of said sections is contained in at least one partition of said partitioned index; and
updating said partitioned index using an updated version of said document by updating only those sections of said index corresponding to sections of said document that have been updated in said updated version of said document.
1 Assignment
0 Petitions
Accused Products
Abstract
A system, method, and computer program product for updating a partitioned index of a dataset. A document is indexed by separating it into indexable sections, such that different ones of the indexable sections may be contained in different partitions of the partitioned index. The partitioned index is updated using an updated version of the document by updating only those sections of the index corresponding to sections of the document that have been updated in the updated version.
-
Citations
20 Claims
-
1. A method of updating a partitioned index of a dataset comprising:
-
indexing a document by separating a document into sections, wherein at least one of said sections is contained in at least one partition of said partitioned index; and updating said partitioned index using an updated version of said document by updating only those sections of said index corresponding to sections of said document that have been updated in said updated version of said document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of searching a dataset having a plurality of documents comprising:
-
indexing said dataset using a partitioned inverted index, each document having a plurality of document sections, each document section indexed by at most one partition; and searching said index by searching across said document sections. - View Dependent Claims (10, 11, 12)
-
-
13. A partitioned inverted index comprising:
-
an ingestion thread receiving a work item and placing it in a queue; a sort-write thread for dequeuing said work item, sorting said work item to create a new index partition and writing said new index partition to disk; a merge manager thread for determining when to merge partitions; a merge thread for merging partitions in response to an instruction from said merge manager; and a state manager thread for receiving a notification from said sort-write thread of said new index partition and for updating an index state, wherein said sort-write thread, said merge manager thread, said merge thread, and said state manager threads operate in parallel. - View Dependent Claims (14, 15)
-
-
16. A computer program product comprising a computer usable medium having a computer readable program, wherein said computer readable program when executed on a computer causes said computer to:
-
index a document by separating a document into sections, wherein at least one of said sections is contained in at least one partition of a partitioned index; and update said partitioned index using an updated version of said document by updating only those sections of said index corresponding to sections of said document that have been updated in said updated version of said document. - View Dependent Claims (17, 18, 19, 20)
-
Specification