Fixed size extents for variable size deduplication segments
First Claim
Patent Images
1. A method comprising:
- receiving a data stream at an interface;
segmenting at least a portion of the data stream to generate a plurality of variable size segments including a first segment;
determining whether the first segment is maintained in storage;
identifying a predetermined size value for a fixed size extent;
determining a first number of fixed sized extents in the storage needed to maintain the first segment in view of the predetermined size value; and
maintaining the first segment in a plurality of fixed size extents in the storage,wherein after the first segment is deleted, the plurality of fixed size extents in the storage used to maintain the now deleted first segment are used to maintain at least one portion of a new segment.
6 Assignments
0 Petitions
Accused Products
Abstract
Mechanisms are provided for maintaining variable size deduplication segments using fixed size extents. Variable size segments are identified and maintained in a datastore suitcase. Duplicate segments need not be maintained redundantly but can be managed by updating reference counts associated with the segments in the datastore suitcase. Segments are maintained using fixed size extents. A minor increase in storage overhead removes the need for inefficient recompaction when a segment is removed from the datastore suitcase. Fixed size extents can be reallocated for storage of new segments.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving a data stream at an interface; segmenting at least a portion of the data stream to generate a plurality of variable size segments including a first segment; determining whether the first segment is maintained in storage; identifying a predetermined size value for a fixed size extent; determining a first number of fixed sized extents in the storage needed to maintain the first segment in view of the predetermined size value; and maintaining the first segment in a plurality of fixed size extents in the storage, wherein after the first segment is deleted, the plurality of fixed size extents in the storage used to maintain the now deleted first segment are used to maintain at least one portion of a new segment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
-
an interface operable to receive a data stream; a processor operable to; generate, from at least a portion of the data stream, a plurality of variable size segments including a first segment; determine whether the first segment is maintained in storage; identify a predetermined size value for a fixed size extent; determine a first number of fixed sized extents in the storage needed to maintain the first segment in view of the predetermined size value; and maintain the first segment in a plurality of fixed size extents in the storage, wherein after the first segment is deleted, the plurality of fixed size extents in the storage used to maintain the now deleted first segment are used to maintain at least one portion of a new segment. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A non-transitory computer readable storage medium comprising instructions that, when executed by a processor, cause a system to perform operations comprising:
-
receive a data stream at an interface; segment at least a portion of the data stream to generate a plurality of variable size segments including a first segment; determine whether the first segment is maintained in storage; identify a predetermined size value for a fixed size extent; determine a first number of fixed sized extents in the storage needed to maintain the first segment in view of the predetermined size value; and maintain the first segment in a plurality of fixed size extents in the storage, wherein after the first segment is deleted, the plurality of fixed size extents in the storage used to maintain the now deleted first segment are used to maintain at least one portion of a new segment. - View Dependent Claims (20)
-
Specification