×

Method for cleaning a delta storage system

  • US 9,405,764 B1
  • Filed: 07/16/2015
  • Issued: 08/02/2016
  • Est. Priority Date: 06/13/2012
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for performing garbage collection in a delta compressed data storage system, the method comprising:

  • traversing a plurality of file recipes stored in the data storage system to identify a plurality of data chunks, the plurality of file recipes representing a plurality of files, wherein each file recipe includes a plurality of chunk identifiers identifying a plurality of data chunks that constitute a corresponding file represented by the file recipe;

    for each of the file recipes,adding chunk identifiers contained in the file recipe to a set of live data chunks, andfor each of the chunk identifiers in the file recipe,performing a lookup operation in an index of the storage system to determine whether there is a delta reference associated with the data chunk identifier, wherein each entry of the index stores a chunk identifier of a corresponding data chunk and a delta reference when the corresponding data chunk has been delta encoded from another data chunk,traversing the index using the chunk identifiers referenced in each of the file recipes to determine matches between the chunk identifiers and entries in the index,selecting base data chunk identifiers associated with matched entries as the delta references, andadding the delta reference corresponding to the chunk identifier to the set of live data chunks; and

    discarding data chunks in a data storage system that are not identified by the set of live data chunks, wherein the set of live data chunks is a listing that identifies live data chunks in the data storage system.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×