×

Parallel file system and method with extensible hashing

  • US 5,893,086 A
  • Filed: 07/11/1997
  • Issued: 04/06/1999
  • Est. Priority Date: 07/11/1997
  • Status: Expired due to Term
First Claim
Patent Images

1. In a system which is used for storing and indexing a large set of data records and supports fast insert, delete and lookup operations and also sequential retrieval of all data records of said set, a method comprising:

  • providing a file system for said system which allows storing and retrieving data by specifying a key that identifies a data record,providing for a set of data records an index or directory with a single initial hash bucket, and storing in said initial hash by use of a hash function all records of a set of data records to be stored as long as they fit into said initial hash bucket, and when the initial hash bucket is full, splitting by adding a second hash bucket and adding one bit to said hash function used to place records whereby those records without said one bit are moved into said initial hash bucket while those records with said one bit are moved into said second hash bucket and wherein new records are added to the initial bucket or said second bucket depending upon the value of the bit for said hash function, and if a hash bucket fills up again, the bucket is split and two bits for the hash function determine where records from the second bucket are to be placed, while the records in the initial bucket are not effected by a new split of said second bucket, but can be split as well, andwherein each hash bucket is stored in a sparse file at an offset given as i*s, where i is the hash bucket number and s is the hash bucket size, an where a directory starts out as an empty file, where the file size increases to the size where it needs to be split by inserting records,and wherein upon a split, an additional bucket is written increasing the file size from s to 2*s upon the first split.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×