EFFICIENT CONSTRUCTION OF SYNTHETIC BACKUPS WITHIN DEDUPLICATION STORAGE SYSTEM
First Claim
1. For a plurality of input data segments in a deduplication storage system, a method of facilitating construction of a synthetic backup by a processor device, comprising:
- for each input data segment of the plurality of input data segments, retrieving a plurality of stored deduplication digests of stored data segments, referenced by the input data segments;
partitioning each input data segment into each of a plurality of fixed-sized data sub-segments;
for each of the plurality of data sub-segments, calculating each of a plurality of input deduplication digests based on the retrieved plurality of stored deduplication digests; and
aggregating each of the plurality of sub-segment deduplication digests to generate a deduplication digest of each input data segment, wherein the deduplication digests of each input data segment form a deduplication digest of the synthetic backup.
0 Assignments
0 Petitions
Accused Products
Abstract
A deduplication storage system enables new input data to be deduplicated with data of synthetic backups already constructed, and for this purpose efficiently calculates deduplication digests for synthetic backups being constructed, based on already existing digests of data referenced by the synthetic backups. For each input data segment of the plurality of input data segments of a synthetic backup being constructed, a plurality of deduplication digests of stored data segments, referenced by the input data segment, is retrieved from an index. Each input data segment is partitioned into each of a plurality of fixed-sized data sub-segments. A calculation is performed producing a deduplication digest for a data sub-segment, where the calculation is based on the retrieved deduplication digests of the plurality of stored data sub-segments referenced by the input data sub-segment.
41 Citations
9 Claims
-
1. For a plurality of input data segments in a deduplication storage system, a method of facilitating construction of a synthetic backup by a processor device, comprising:
-
for each input data segment of the plurality of input data segments, retrieving a plurality of stored deduplication digests of stored data segments, referenced by the input data segments; partitioning each input data segment into each of a plurality of fixed-sized data sub-segments; for each of the plurality of data sub-segments, calculating each of a plurality of input deduplication digests based on the retrieved plurality of stored deduplication digests; and aggregating each of the plurality of sub-segment deduplication digests to generate a deduplication digest of each input data segment, wherein the deduplication digests of each input data segment form a deduplication digest of the synthetic backup. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
Specification