Method for maintaining an index
First Claim
1. A computer implemented method for maintaining an index of a database, the database storing information as a plurality of records, comprising:
- indexing batches of the records by storing index entries in a memory, each index entry including a word entry for each unique portion of information of the database, and one or more location entries pointing at occurrences of the portions of information;
collating the index entries according to the order of the word entries, and sequentially according to the locations of each word entry;
organizing the index entries in a plurality of tiers of files, there at least initially being one tier of files for each batch of records indexed; and
periodically merging a subsequently produced tier of files with a previously produced tier of files to produce a merged tier of files, the index entries being a logical union of the index entries of the subsequently and previous produced tiers of files.
12 Assignments
0 Petitions
Accused Products
Abstract
A computer implemented method maintains an index of a database. The database stores information as a plurality of records. Batches of records are indexed by storing index entries in a memory. Each index entry includes a word entry for each unique portion of information of the database, and one or more location entries pointing at occurrences of the portions of information. The index entries are organized according to a collating order of the word entries. The location entries are stored sequentially for each word entry. The index entries are organized into a plurality of tiers of files. There is one tier of files for each batch of records indexed. A merged tier of files is periodically produced by merging a subsequently produced tier of files with a previously produced tier of files. The index entries of the merged tier of files are a logical union of the index entries of the subsequently and previous produced tiers of files. The location entries of deleted records are expunged while merging the tiers of files.
268 Citations
15 Claims
-
1. A computer implemented method for maintaining an index of a database, the database storing information as a plurality of records, comprising:
-
indexing batches of the records by storing index entries in a memory, each index entry including a word entry for each unique portion of information of the database, and one or more location entries pointing at occurrences of the portions of information; collating the index entries according to the order of the word entries, and sequentially according to the locations of each word entry; organizing the index entries in a plurality of tiers of files, there at least initially being one tier of files for each batch of records indexed; and periodically merging a subsequently produced tier of files with a previously produced tier of files to produce a merged tier of files, the index entries being a logical union of the index entries of the subsequently and previous produced tiers of files. - View Dependent Claims (2)
-
-
3. A computer implemented method for maintaining an index of a database, the database storing information as a plurality of records, comprising the steps of:
-
indexing batches of the records by storing index entries in a memory; organizing the index entries into a plurality of tiers, including a first tier and a second tier, wherein the first tier includes a first bucket and the second tier includes a second bucket, the first bucket and the second bucket each including at least one of the index entries; and merging the first tier with the second tier to merge the first bucket with the second bucket. - View Dependent Claims (4, 5, 6, 7)
-
-
8. A computer system for maintaining a database storing information as a plurality of records, the computer system comprising:
-
a memory configured to store a first and a second section of data, the first section having a first storage area configured to store a first entry and the second section having a second storage area configured to store a second entry, wherein the second storage area is a logical extension of the first storage area; and a processor configured to merge the second storage area with the first storage area. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
Specification