×

Deduplicating storage with enhanced frequent-block detection

  • US 9,767,140 B2
  • Filed: 08/25/2015
  • Issued: 09/19/2017
  • Est. Priority Date: 04/30/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method for detecting data duplication, comprising:

  • maintaining a fingerprint directory comprising one or more entries, each entry including a data fingerprint and a data location for a data chunk;

    maintaining a shadow list comprising a record of fingerprint values removed from the fingerprint directory, wherein the shadow list comprises an allocation of resources with associated methods to insert fingerprints and to look up fingerprints, and to return a result of a lookup in the shadow list;

    associating each said entry with a seen-count attribute which is an indication of how often a data fingerprint has been seen in arriving data chunks to be written in a storage system, and distinguishes multiply-seen entries for data fingerprints present in at least two data chunks from once-seen entries for data fingerprints present in no more than a single data chunk; and

    retrieving entries from the shadow list such that each entry retrieved from the shadow list comprises twice-seen fingerprints.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×