Fixed size extents for variable size deduplication segments
First Claim
Patent Images
1. A method comprising:
- receiving a data stream at an interface;
performing segment boundary identification and segment fingerprinting to generate a plurality of variable size segments including a first segment;
determining whether the first segment is maintained in storage;
determining a first number of fixed sized extents in the storage needed to maintain the first segment;
maintaining the first segment in a plurality of contiguous fixed size extents in the storage,wherein after the first segment is deleted, the plurality of contiguous fixed size extents in the storage used to maintain the now deleted first segment are used to maintain portions of a plurality of new segments without having to perform recompaction.
23 Assignments
0 Petitions
Accused Products
Abstract
Mechanisms are provided for maintaining variable size deduplication segments using fixed size extents. Variable size segments are identified and maintained in a datastore suitcase. Duplicate segments need not be maintained redundantly but can be managed by updating reference counts associated with the segments in the datastore suitcase. Segments are maintained using fixed size extents. A minor increase in storage overhead removes the need for inefficient recompaction when a segment is removed from the datastore suitcase. Fixed size extents can be reallocated for storage of new segments.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving a data stream at an interface; performing segment boundary identification and segment fingerprinting to generate a plurality of variable size segments including a first segment; determining whether the first segment is maintained in storage; determining a first number of fixed sized extents in the storage needed to maintain the first segment; maintaining the first segment in a plurality of contiguous fixed size extents in the storage, wherein after the first segment is deleted, the plurality of contiguous fixed size extents in the storage used to maintain the now deleted first segment are used to maintain portions of a plurality of new segments without having to perform recompaction. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
-
an interface operable to receive a data stream; a hardware processor operable to perform segment boundary identification and segment fingerprinting to generate a plurality of variable size segments including a first segment and determining whether the first segment is maintained in storage; wherein a first number of fixed sized extents in the storage needed to maintain the segment is determined; wherein the processor is further operable to maintain the first segment in a plurality of contiguous fixed size extents in the storage, wherein after the first segment is deleted, the plurality of contiguous fixed size extents in the storage used to maintain the now deleted first segment are used to maintain portions of a plurality of new segments without having to perform recompaction. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. Anon-transitory computer readable storage medium comprising instructions for:
-
receiving a data stream at an interface; performing segment boundary identification and segment fingerprinting to generate a plurality of variable size segments including a first segment; determining whether the first segment is maintained in storage; determining a first number of fixed sized extents in the storage needed to maintain the first segment; maintaining the first segment in a plurality of contiguous fixed size extents in the storage, wherein after the first segment is deleted, the plurality of contiguous fixed size extents in the storage used to maintain the now deleted first segment are used to maintain portions of a plurality of new segments without having to perform recompaction. - View Dependent Claims (20)
-
Specification