×

DEDUPLICATING DATA AT SUB-BLOCK GRANULARITY

  • US 20200133928A1
  • Filed: 10/31/2018
  • Published: 04/30/2020
  • Est. Priority Date: 10/31/2018
  • Status: Active Grant
First Claim
Patent Images

1. A method of performing data deduplication, the method comprising:

  • receiving, by a data storage system, an I/O (Input/Output) request that specifies a write of a set of data to the data storage system, the data storage system defining a candidate block from at least a portion of the set of data, the candidate block including multiple uniformly-sized sub-blocks, the sub-blocks including a candidate sub-block;

    searching a deduplication database for a target sub-block that matches the candidate sub-block; and

    in response to finding a matching entry in the deduplication database for the target sub-block, (i) accessing, based on a reference stored in the matching entry, a previously-stored target block that contains the target sub-block, (ii) identifying a shared range between the candidate block and the target block for which a duplicate range RDUP of the candidate block matches a target range RTARG of the target block, and (iii) effecting persistent storage of the duplicate range RDUP of the candidate block by configuring mapping metadata of the candidate block to reference the target range RTARG of the target block.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×