Method for mapping an index of a database into an array of files
First Claim
1. A computer implemented method for mapping an index of information of a database as a plurality of records, comprising:
- parsing a batch of the records into words and locations, each word representing a portion of the information of a particular record, and each location assigned sequentially in the order of parsing the words;
generating an index entry for each unique word, each index entry including the unique word and all of the locations of the unique word;
hashing each unique word to determine a bucket number;
writing index entries having identical bucket numbers to a single index file in an order according to their unique words;
generating a summary file for each index file by periodically sampling the data being written to the index file;
grouping the index files and their associated summary files into a tier of files representing an index of the batch of records.
12 Assignments
0 Petitions
Accused Products
Abstract
A computer implemented method maps an index of information of a database into an array of files. The information is stored as records of a database. Batches of records are parsed into words and locations, each word representing a portion of the information of a particular record, the locations are sequentially assigned to the words in the order of parsing the words. An index entry is generated for each unique word, each index entry including the unique word and all of the locations where the unique word occurs in the database. Each unique word is hashed to determine a bucket number. Index entries having identical bucket numbers are written to a single index file associated with the bucket number in a collating order of the unique words. A summary file is generated for each index file by periodically sampling the data being written to the index file. Index files and their associated summary files are grouped into a tier of files, there being one tier for each batch of records parsed.
67 Citations
2 Claims
-
1. A computer implemented method for mapping an index of information of a database as a plurality of records, comprising:
-
parsing a batch of the records into words and locations, each word representing a portion of the information of a particular record, and each location assigned sequentially in the order of parsing the words; generating an index entry for each unique word, each index entry including the unique word and all of the locations of the unique word; hashing each unique word to determine a bucket number; writing index entries having identical bucket numbers to a single index file in an order according to their unique words; generating a summary file for each index file by periodically sampling the data being written to the index file; grouping the index files and their associated summary files into a tier of files representing an index of the batch of records. - View Dependent Claims (2)
-
Specification