Performant and scalable merge strategy for text indexing
First Claim
1. A method of indexing data and merging indexes within a database management system, in which the need for an in-memory table tracking currency of the data is eliminated, the method comprising:
- assigning a temporal indicator associated with a crawl to an instance of an index generated by the crawl, wherein the index is part of an indexing system of a database management system;
assigning the instance of the index to a set of instances of the index to be merged; and
validating the set of instances to be merged, generating a set of valid instances.
2 Assignments
0 Petitions
Accused Products
Abstract
A full-text search index system and method is generated by creating instances of a database index from an in-memory inverted list of keywords associated with a text identifier and the occurrences of the keyword in the text. Instances of the index are placed in a priority queue. A merge scheduling process determines when a merge should be initiated, selects instances of the index to be merged and selects a type of merge to perform.
Instances of an index are assigned a temporal indicator (timestamp). A set of instances is selected to be merged. The set of instances is validated and merged.
43 Citations
24 Claims
-
1. A method of indexing data and merging indexes within a database management system, in which the need for an in-memory table tracking currency of the data is eliminated, the method comprising:
-
assigning a temporal indicator associated with a crawl to an instance of an index generated by the crawl, wherein the index is part of an indexing system of a database management system;
assigning the instance of the index to a set of instances of the index to be merged; and
validating the set of instances to be merged, generating a set of valid instances. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system for indexing data and merging indexes within a database management system, in which the need for an in-memory table tracking currency of the data is eliminated, the system comprising:
-
an indexing module for indexing data and assigning a temporal indicator to an instance of an index;
a priority queue for storing instances of an index for merging; and
a merging module for validating selected instances from the priority queue for merging, generating a set of valid instances to be merged. - View Dependent Claims (20, 21, 22, 23)
-
-
24. A computer-readable medium including computer-executable instructions for
assigning a first temporal indicator associated with a first crawl to a first instance of an index generated by the crawl; - and
assigning the first instance of the index to a set of instances of the index to be merged;
assigning a second temporal indicator associated with a second crawl to a second instance of an index generated by the second crawl;
assigning the second instance of the index to the set of instances to be merged;
validating the set of instances to be merged, generated valid instances; and
merging the valid instances to create a merged index instance.
- and
Specification