Bulk Search Index Updates
First Claim
Patent Images
1. A method for updating a document search index having a plurality of index segments, the method comprising:
- executing at least one update request comprising at least one transformation, the at least one update request identifying a plurality of documents to be updated within the document index;
identifying a set of matching index segments, within the document search index, associated with the plurality of documents to be updated;
updating a first set of stored fields associated with the plurality of documents to be updated by applying the at least one transformation to modify the first set of stored fields;
generating a modified postings list for the document search index corresponding to the updated first set of stored fields; and
updating a list of postings of the document search index based on the modified postings list.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments of the present invention perform bulk updates of a search index for an information repository. In embodiments, a batched set of update requests is run and a set of documents to be updated based on the set of requests is identified. In embodiments, a bulk update method to use is selected based on an estimate of the cost of performing the bulk update. In embodiments, a bulk update method based on updating only the indexes of the documents to be updated may be used instead of a bulk update method that involves re-indexing the full set of documents in the repository.
-
Citations
29 Claims
-
1. A method for updating a document search index having a plurality of index segments, the method comprising:
-
executing at least one update request comprising at least one transformation, the at least one update request identifying a plurality of documents to be updated within the document index; identifying a set of matching index segments, within the document search index, associated with the plurality of documents to be updated; updating a first set of stored fields associated with the plurality of documents to be updated by applying the at least one transformation to modify the first set of stored fields; generating a modified postings list for the document search index corresponding to the updated first set of stored fields; and updating a list of postings of the document search index based on the modified postings list. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for identifying a bulk update for a document search index having a plurality of index segments, the method comprising:
-
identifying a plurality of documents to be updated, within the document search index, based on at least one update request comprising at least one transformation; identifying a first set of matching index segments, within the plurality of index segments, associated with the plurality of documents to be updated; determining a first processing cost for updating the document search index associated with the plurality of documents to be updated; determining a second processing cost for updating the document search index associated with the first set of matching index segments; and selecting the bulk update for updating the document search index, the selected bulk update being at least partially based on a relative comparison of the first processing cost to the second processing cost. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system for applying a set of update requests to a document search index having a plurality of index segments, the system comprising:
-
a matching document identifier, coupled to receive the set of update requests, that identifies a plurality of documents to be updated within the document search index; an update method selector, coupled to receive the plurality of documents to be updated, that selects a bulk update for the document search index by performing a method comprising the steps of; identifying a first set of matching index segments, within the plurality of index segments, associated with the plurality of documents to be updated; determining a first processing cost for updating the document search index associated with the plurality of documents to be updated; determining a second processing cost for updating the document search index associated with the first set of matching index segments; and selecting the bulk update for updating the document search index, the selected bulk update being at least partially based on a relative comparison of the first processing cost to the second processing cost; and an index updater that applies the selected bulk update to the document search index. - View Dependent Claims (20, 21, 22, 23)
-
-
24. An index updater that applies a set of update requests to a document search index having a plurality of index segments, the system comprising:
-
a stored fields updater, coupled to receive a plurality of documents to be updated and a set of matching index segments, the stored fields updater updates a set of stored fields associated with the plurality of documents to be updated; and a postings updater, coupled to receive the updated set of stored fields, the postings updater updates a list of postings of the document search index by performing a method comprising the steps of; generating a modified postings list for the document search index corresponding to the updated set of stored fields; and updating the list of postings of the document search index at least partially based on the modified postings list. - View Dependent Claims (25, 26, 27, 28, 29)
-
Specification