×

Deduplicating sets of data blocks

  • US 9,846,718 B1
  • Filed: 03/31/2014
  • Issued: 12/19/2017
  • Est. Priority Date: 03/31/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method for use in deduplicating sets of data blocks, the method comprising:

  • identifying a candidate data object for deduplicating a data object, wherein a digest associated with the candidate data object matches a digest associated with the data object wherein an index table stores digest information for the candidate data object and digest information for at least one data object residing adjacent to the candidate data object;

    upon detecting a digest collision, determining whether to apply a deduplicating technique to the candidate data object and the data object by evaluating digest information of a set of data blocks residing adjacent to the candidate data block, wherein evaluating the digest information of the set of data blocks residing adjacent to the candidate data block includes determining whether a first set of data objects residing adjacent to the data object is identical to a second set of data objects residing at adjacent to the candidate data object, wherein the first and second sets of data objects are selected based on a reference of locality indicating a likelihood of the data object being identical to the candidate data object, wherein whether the first set of data objects is identical to the second set of data objects is determined by evaluating respective digest information of the first and second sets of data objects, wherein each data object of a set of data objects is associated with a respective digest, wherein the digest collision indicates that at least two data objects storing different contents are associated with the same digest; and

    based on the determination that the first and second sets of data objects are identical to each other, applying a deduplicating technique to the data object and the candidate data object for deduplicating the data object to the candidate data object.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×