×

Deduplication featuring variable-size duplicate data detection and fixed-size data segment sharing

  • US 10,656,858 B1
  • Filed: 10/07/2016
  • Issued: 05/19/2020
  • Est. Priority Date: 12/15/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving a data stream;

    dividing the data stream into a plurality of variable-sized sections;

    calculating a fingerprint for a first variable-sized section of the plurality of variable-sized sections, whereinthe fingerprint is calculated as a function of all data within the first variable-sized section;

    determining whether the fingerprint matches a stored fingerprint;

    in response to determining that the fingerprint matches the stored fingerprint, identifying a fixed-length data segment in storage that contains a copy of data in the first variable-sized section, whereinthe fixed-length data segment comprisesthe copy, andadditional data that is not found in the first variable-sized section;

    replacing the first variable-sized section with a plurality of references including a reference to the fixed-length data segment; and

    updating a reference file to identifythe fixed-length data segment,a first portion of data in the fixed-length data segment that corresponds to the copy, anda second portion of data in the fixed-length data segment that corresponds to the additional data.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×