PRODUCING ALTERNATIVE SEGMENTATIONS OF DATA INTO BLOCKS IN A DATA DEDUPLICATION SYSTEM
First Claim
Patent Images
9. A system for producing a plurality of segmentations of input data into blocks in a data deduplication system of a computing environment, the system comprising:
- the data deduplication system;
a repository operating in the data deduplication system;
a memory in the data deduplication system;
a search structure in association with the memory in the data deduplication system; and
at least one processor device operable in the computing storage environment for controlling the data deduplication system, wherein the at least one processor device;
calculates digests for an input data chunk using a primary segmentation,obtains and applies secondary segmentations for each one of a plurality of data mismatches based on reference data, andstores the primary segmentation and corresponding primary digests for the input data chunk.
1 Assignment
0 Petitions
Accused Products
Abstract
For producing secondary segmentations of data into blocks and corresponding digests for input data in a data deduplication system using a processor device in a computing environment, digests are calculated for an input data chunk using a primary segmentation into blocks. Secondary segmentations are produced for each of the data mismatches based on reference data, and used to calculate further data matches. The primary segmentation and the corresponding primary digests are stored for the input data chunk.
13 Citations
24 Claims
-
9. A system for producing a plurality of segmentations of input data into blocks in a data deduplication system of a computing environment, the system comprising:
-
the data deduplication system; a repository operating in the data deduplication system; a memory in the data deduplication system; a search structure in association with the memory in the data deduplication system; and at least one processor device operable in the computing storage environment for controlling the data deduplication system, wherein the at least one processor device; calculates digests for an input data chunk using a primary segmentation, obtains and applies secondary segmentations for each one of a plurality of data mismatches based on reference data, and stores the primary segmentation and corresponding primary digests for the input data chunk. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product for producing a plurality of segmentations of input data into blocks in a data deduplication system using a processor device in a computing environment, the computer program product comprising a computer-readable storage medium having computer-readable program code portions stored therein, the computer-readable program code portions comprising:
-
a first executable portion that calculates digests for an input data chunk using a primary segmentation; a second executable portion that obtains and applies secondary segmentations for each one of a plurality of data mismatches based on reference data; and a third executable portion that stores the primary segmentation and corresponding primary digests for the input data chunk. - View Dependent Claims (1, 2, 3, 4, 5, 6, 7, 8, 18, 19, 20, 21, 22)
-
-
18-1. The computer program product of claim 17, further including a fourth executable portion that obtains the segmentations for each one of the data mismatches by considering input digests included in data matches preceding and following each one of the data mismatches.
-
24. The computer program product of claim 23, further including a ninth executable portion that avoids storing the secondary segmentations and corresponding secondary digests for the input data chunk.
Specification