EFFICIENT CONSTRUCTION OF SYNTHETIC BACKUPS WITHIN DEDUPLICATION STORAGE SYSTEM
First Claim
1. For a plurality of input data segments in a deduplication storage system, a method of facilitating construction of a synthetic backup by a processor device, comprising:
- for each input data segment of the plurality of input data segments, retrieving a plurality of stored deduplication digests of stored data segments, referenced by the input data segments;
partitioning each input data segment into each of a plurality of fixed-sized data sub-segments;
for each of the plurality of data sub-segments, calculating each of a plurality of input deduplication digests based on the retrieved plurality of stored deduplication digests; and
aggregating each of the plurality of sub-segment deduplication digests to generate a deduplication digest of each input data segment, wherein the deduplication digests of each input data segment form a deduplication digest of the synthetic backup.
1 Assignment
0 Petitions
Accused Products
Abstract
Various embodiments are provided for facilitating construction of a synthetic backup in a deduplication storage system. In one embodiment, a deduplication storage system enables new input data to be deduplicated with data of synthetic backups already constructed, and for this purpose efficiently calculates deduplication digests for synthetic backups being constructed, based on already existing digests of data referenced by the synthetic backups. For each input data segment of the plurality of input data segments of a synthetic backup being constructed, a plurality of deduplication digests of stored data segments, referenced by the input data segment, is retrieved from an index. Each input data segment is partitioned into each of a plurality of fixed-sized data sub-segments. A calculation is performed producing a deduplication digest for a data sub-segment, where the calculation is based on the retrieved deduplication digests of the plurality of stored data sub-segments referenced by the input data sub-segment.
65 Citations
27 Claims
-
1. For a plurality of input data segments in a deduplication storage system, a method of facilitating construction of a synthetic backup by a processor device, comprising:
-
for each input data segment of the plurality of input data segments, retrieving a plurality of stored deduplication digests of stored data segments, referenced by the input data segments; partitioning each input data segment into each of a plurality of fixed-sized data sub-segments; for each of the plurality of data sub-segments, calculating each of a plurality of input deduplication digests based on the retrieved plurality of stored deduplication digests; and aggregating each of the plurality of sub-segment deduplication digests to generate a deduplication digest of each input data segment, wherein the deduplication digests of each input data segment form a deduplication digest of the synthetic backup. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system, for a plurality of input data segments in a deduplication storage system, for facilitating construction of a synthetic backup, comprising:
at least one processor device, operable in the deduplication computing storage environment, wherein the at least one processor device is adapted for; for each input data segment of the plurality of input data segments, retrieving a plurality of stored deduplication digests of stored data segments, referenced by the input data segments, partitioning each input data segment into each of a plurality of fixed-sized data sub-segments, for each of the plurality of data sub-segments, calculating each of a plurality of input deduplication digests based on the retrieved plurality of stored deduplication digests, and aggregating each of the plurality of sub-segment deduplication digests to generate a deduplication digest of each input data segment, wherein the deduplication digests of each input data segment form a deduplication digest of the synthetic backup. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
19. A computer program product, for a plurality of input data segments in a deduplication storage system, for facilitating construction of a synthetic backup by at least one processor device, the computer program product comprising a computer-readable storage medium having computer-readable program code portions stored therein, the computer-readable program code portions comprising:
-
a first executable portion for, for each input data segment of the plurality of input data segments, retrieving a plurality of stored deduplication digests of stored data segments, referenced by the input data segments; a second executable portion for partitioning each input data segment into each of a plurality of fixed-sized data sub-segments; a third executable portion for, for each of the plurality of data sub-segments, calculating each of a plurality of input deduplication digests based on the retrieved plurality of stored deduplication digests; and a fourth executable portion for aggregating each of the plurality of sub-segment deduplication digests to generate a deduplication digest of each input data segment, wherein the deduplication digests of each input data segment form a deduplication digest of the synthetic backup. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
Specification