Efficient construction of synthetic backups within deduplication storage system
First Claim
1. For a plurality of new input data segments in a deduplication storage system, a method of facilitating construction of a synthetic backup by a processor device, the synthetic backup being independent of and constructed from an originating backup being a full, existing backup, the method comprising:
- for each new input data segment of the plurality of new input data segments, retrieving a plurality of stored deduplication digests of stored data segments, referenced by the new input data segments, the stored data segments being data taken from the originating backup, and the plurality of stored deduplication digests being deduplication digests calculated from the stored data segments,partitioning each new input data segment into each of a plurality of fixed-sized data sub-segments,for each of the plurality of data sub-segments, during the construction of the synthetic backup, calculating each of a plurality of input deduplication digests based on the retrieved plurality of stored deduplication digests,aggregating each of the plurality of sub-segment deduplication digests to generate a deduplication digest of each new input data segment,searching the plurality of stored deduplication digests of the stored data segments for matches with the deduplication digest of each new input data segment to thereby deduplicate each new input data segment, andforming a deduplication digest of the synthetic backup from the deduplication digests of each new input data segment.
0 Assignments
0 Petitions
Accused Products
Abstract
A deduplication storage system enables new input data to be deduplicated with data of synthetic backups already constructed, and for this purpose efficiently calculates deduplication digests for synthetic backups being constructed, based on already existing digests of data referenced by the synthetic backups. For each input data segment of the plurality of input data segments of a synthetic backup being constructed, a plurality of deduplication digests of stored data segments, referenced by the input data segment, is retrieved from an index. Each input data segment is partitioned into each of a plurality of fixed-sized data sub-segments. A calculation is performed producing a deduplication digest for a data sub-segment, where the calculation is based on the retrieved deduplication digests of the plurality of stored data sub-segments referenced by the input data sub-segment.
-
Citations
7 Claims
-
1. For a plurality of new input data segments in a deduplication storage system, a method of facilitating construction of a synthetic backup by a processor device, the synthetic backup being independent of and constructed from an originating backup being a full, existing backup, the method comprising:
-
for each new input data segment of the plurality of new input data segments, retrieving a plurality of stored deduplication digests of stored data segments, referenced by the new input data segments, the stored data segments being data taken from the originating backup, and the plurality of stored deduplication digests being deduplication digests calculated from the stored data segments, partitioning each new input data segment into each of a plurality of fixed-sized data sub-segments, for each of the plurality of data sub-segments, during the construction of the synthetic backup, calculating each of a plurality of input deduplication digests based on the retrieved plurality of stored deduplication digests, aggregating each of the plurality of sub-segment deduplication digests to generate a deduplication digest of each new input data segment, searching the plurality of stored deduplication digests of the stored data segments for matches with the deduplication digest of each new input data segment to thereby deduplicate each new input data segment, and forming a deduplication digest of the synthetic backup from the deduplication digests of each new input data segment. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
Specification