×

Region-integrated data deduplication implementing a multi-lifetime duplicate finder

  • US 10,394,764 B2
  • Filed: 03/29/2016
  • Issued: 08/27/2019
  • Est. Priority Date: 03/29/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer program product for performing deduplication in conjunction with random read and write operations across a namespace, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, wherein the computer readable storage medium is not a transitory signal per se, the program instructions executable by a computer to cause the computer to perform a method comprising:

  • receiving, at the computer, a write request comprising a data chunk;

    computing, by the computer, a fingerprint of the data chunk;

    determining, by the computer, whether a short term dictionary corresponding to the namespace comprises an entry corresponding to the fingerprint;

    in response to determining the short term dictionary comprises the entry corresponding to the fingerprint, writing, by the computer, the data chunk to a data store corresponding to the namespace in a deduplicating manner;

    in response to determining the short term dictionary does not comprise the entry corresponding to the fingerprint, determining, by the computer, whether a long term dictionary corresponding to the namespace comprises the entry corresponding to the fingerprint;

    in response to determining the long term dictionary comprises the entry corresponding to the fingerprint, writing, by the computer the data chunk to the data store in the deduplicating manner;

    in response to determining the long term dictionary does not comprise the entry corresponding to the fingerprint, writing, by the computer, the data chunk to the data store in a non-deduplicating manner; and

    in response to determining the long term dictionary comprises the entry corresponding to the fingerprint, repopulating the short term dictionary with the entry corresponding to the fingerprint,wherein the short term dictionary comprises a first eviction policy,wherein the long term dictionary comprises a second eviction policy,wherein the first eviction policy is configured to evict one or more entries of the short term dictionary in response to a new entry being inserted into the short term dictionary, andwherein the second eviction policy is configured to evict one or more entries of the long term dictionary in response to a new entry being inserted into the long term dictionary.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×