×

Network file system-based data storage system

  • US 7,747,581 B1
  • Filed: 04/19/2007
  • Issued: 06/29/2010
  • Est. Priority Date: 09/15/2004
  • Status: Active Grant
First Claim
Patent Images

1. A file system comprising:

  • an index engine to maintain an index recoverable from crashes, the index engine including;

    an on-disk circular buffer to store a current on-disk set of hash-table buckets to store a current on-disk set of index entries at a current location;

    an in-memory merge buffer to store a current in-memory set of hash-table buckets to store a current in-memory set of index entries between merges, wherein recovery from crashes between and during merges is effected by one of the in-memory merge buffer being stored in a non-volatile memory and the current in-memory set of index entries being duplicated in an on-disk log;

    a merge mechanism, coupled to the on-disk and the in-memory merge buffers, to merge the current on-disk set of hash-table buckets and the current in-memory set of hash-table buckets to create at a new location in the on-disk circular buffer a new version of the current on-disk set of hash-table buckets storing the merged on-disk and in-memory set of index entries as the current on-disk set of index entries each time the number of index entries of the current in-memory set of index entries reaches a specified value, wherein the on-disk circular buffer is used for recovery from crashes occurring during merges; and

    an on-disk partial indexes buffer to store all prior in-memory sets of hash-table buckets by said merge mechanism used for said merges to facilitate index checking and reconstruction of the index;

    a segment store unit to segment data into the one or more variable length data segments;

    a storage unit, coupled to the index engine, to store at least one copy of each unique variable length data segment; and

    the index engine and the segment store unit to identify new variable length data segments that are identical to one of the variable length data segments already stored in said storage unit, create for each of the new variable length data segments identical to one of the already stored variable length data segments a reference to the already stored identical variable length data segment, store each of the references in the storage unit, and delete each of the new variable length data segments determined identical to one of the already stored variable length data segments.

View all claims
  • 12 Assignments
Timeline View
Assignment View
    ×
    ×