×

Deduplication featuring variable-size duplicate data detection and fixed-size data segment sharing

  • US 9,465,808 B1
  • Filed: 12/15/2012
  • Issued: 10/11/2016
  • Est. Priority Date: 12/15/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • storing a plurality of fixed-size data segments on a storage device;

    calculating a plurality of stored identifiers, whereina first stored identifier of the plurality of stored identifiers identifies a sub-portion of a first fixed-size data segment of the plurality of fixed-size data segments;

    calculating a plurality of identifiers for respective sections of a data stream generated by a client, in response to detecting that the data stream is being written, or is selected to be written, to the storage device, whereinthe data stream comprises two variable-length data segments, andthe plurality of identifiers comprise a first identifier for a first section of the data stream;

    detecting that the first identifier matches a first stored identifier; and

    in response to the detecting, causing an additional reference to be generated instead of writing the first section of the data stream to the storage device as part of a deduplicated data stream, whereinthe deduplicated data stream is associated with a reference stream,the additional reference is included as part of the reference stream,the additional reference identifies the sub-portion of the first fixed-size data segment as part of the data stream, andthe first fixed-size data segment has a different length than the first section of the data stream, andthe calculating, the detecting, and the causing are performed by a computing device implementing a deduplication module, whereinthe reference stream identifies every fixed-size data segment of the plurality of fixed-size data segments that comprises at least one portion of a variable-length data segment of the two variable-length data segments even if the first fixed-size data segment comprises data that is not part of the variable-length data segment.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×