×

ELIMINATION OF DUPLICATE OBJECTS IN STORAGE CLUSTERS

  • US 20130339314A1
  • Filed: 06/13/2012
  • Published: 12/19/2013
  • Est. Priority Date: 06/13/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method of detecting duplicates of a first digital object within a storage cluster, said method comprising:

  • receiving, within said storage cluster, a hash value of said first digital object and a unique identifier associated with said first digital object, said hash value falling within an address space made up of a plurality of non-overlapping ranges;

    identifying a portion of said hash value whose value represents a first range of addresses within said address space;

    accessing a page mapping table using said value indicated by said portion of said hash value to identify a computer node of said storage cluster, said page mapping table mapping each range of addresses of said address space to one of said computer nodes;

    accessing a hash table of said computer node to determine whether said hash value and said unique identifier are represented in said hash table;

    accessing a second digital object within said storage cluster when it is determined that said hash value is represented in an entry in said hash table but said unique identifier is not; and

    comparing a hash value stored in association with said second digital object with said received hash value and determining that said second digital object is a duplicate of said first digital object when said hash values match.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×