FIXED SIZE EXTENTS FOR VARIABLE SIZE DEDUPLICATION SEGMENTS
First Claim
Patent Images
1. A method comprising:
- receiving a data stream at an interface;
performing segment boundary identification and segment fingerprinting to generate a plurality of variable size segments including a first segment;
accessing a deduplication dictionary to determine whether the first segment is maintained in one or more datastore suitcases;
determining a first number of fixed sized extents associated with the one or more datastore suitcases needed to maintain the first segment.
23 Assignments
0 Petitions
Accused Products
Abstract
Mechanisms are provided for maintaining variable size deduplication segments using fixed size extents. Variable size segments are identified and maintained in a datastore suitcase. Duplicate segments need not be maintained redundantly but can be managed by updating reference counts associated with the segments in the datastore suitcase. Segments are maintained using fixed size extents. A minor increase in storage overhead removes the need for inefficient recompaction when a segment is removed from the datastore suitcase. Fixed size extents can be reallocated for storage of new segments.
42 Citations
20 Claims
-
1. A method comprising:
-
receiving a data stream at an interface; performing segment boundary identification and segment fingerprinting to generate a plurality of variable size segments including a first segment; accessing a deduplication dictionary to determine whether the first segment is maintained in one or more datastore suitcases; determining a first number of fixed sized extents associated with the one or more datastore suitcases needed to maintain the first segment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
-
an interface operable to receive a data stream; a processor operable to perform segment boundary identification and segment fingerprinting to generate a plurality of variable size segments including a first segment and access a deduplication dictionary to determine whether the first segment is maintained in one or more datastore suitcases; wherein a first number of fixed sized extents associated with the one or more datastore suitcases needed to maintain the first segment is determined. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer readable storage medium comprising:
-
computer code for receiving a data stream at an interface; computer code for performing segment boundary identification and segment fingerprinting to generate a plurality of variable size segments including a first segment; computer code for accessing a deduplication dictionary to determine whether the first segment is maintained in one or more datastore suitcases; computer code for determining a first number of fixed sized extents associated with the one or more datastore suitcases needed to maintain the first segment. - View Dependent Claims (20)
-
Specification