Systems and methods for managing references in deduplicating data systems
First Claim
1. A computer-implemented method for managing references in deduplicating data systems, at least a portion of the method being performed by a computing device comprising at least one processor, the method comprising:
- identifying a first instance of a data segment stored within a deduplicating data system that reduces redundant data storage by storing at least two data objects such that each of the two data objects references the first instance of the data segment;
identifying an additional data object to be stored by the deduplicating data system, wherein the additional data object comprises the data segment;
determining that an age limit of the first instance of the data segment has been reached, wherein the age limit of the first instance of the data segment;
limits the amount of time during which the first instance of the data segment is available to be referenced; and
indicates an age of the first instance of the data segment after which the first instance of the data segment cannot be referenced by any additional data objects that are later stored to the deduplicating data system;
preventing storage-space fragmentation within the deduplicating data system by storing, based at least in part on determining that the age limit of the first instance of the data segment has been reached, a second instance of the data segment within the deduplicating data system that is referenced by the additional data object while also retaining the first instance of the data segment within the deduplicating data system.
7 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method for managing references in deduplicating data systems may include (1) identifying a first instance of a data segment stored within a deduplicating data system that reduces redundant data storage by storing at least two data objects such that each of the two data objects references the first instance of the data segment, (2) identifying an additional data object to be stored by the deduplicating data system that includes the data segment, (3) determining whether a reference limit associated with the first instance of the data segment has been reached, and (4) storing, based at least in part on determining that the reference limit associated with the first instance of the data segment has been reached, a second instance of the data segment within the deduplicating data system. Various other methods, systems, and computer-readable media are also disclosed.
-
Citations
20 Claims
-
1. A computer-implemented method for managing references in deduplicating data systems, at least a portion of the method being performed by a computing device comprising at least one processor, the method comprising:
-
identifying a first instance of a data segment stored within a deduplicating data system that reduces redundant data storage by storing at least two data objects such that each of the two data objects references the first instance of the data segment; identifying an additional data object to be stored by the deduplicating data system, wherein the additional data object comprises the data segment; determining that an age limit of the first instance of the data segment has been reached, wherein the age limit of the first instance of the data segment; limits the amount of time during which the first instance of the data segment is available to be referenced; and indicates an age of the first instance of the data segment after which the first instance of the data segment cannot be referenced by any additional data objects that are later stored to the deduplicating data system; preventing storage-space fragmentation within the deduplicating data system by storing, based at least in part on determining that the age limit of the first instance of the data segment has been reached, a second instance of the data segment within the deduplicating data system that is referenced by the additional data object while also retaining the first instance of the data segment within the deduplicating data system. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for managing references in deduplicating data systems, the system comprising:
-
an identifying module that identifies; a first instance of a data segment stored within a deduplicating data system that reduces redundant data storage by storing at least two data objects such that each of the two data objects references the first instance of the data segment; an additional data object to be stored by the deduplicating data system, wherein the additional data object comprises the data segment; a determining module that determines that an age limit of the first instance of the data segment has been reached, wherein the age limit of the first instance of the data segment; limits the amount of time during which the first instance of the data segment is available to be referenced; and indicates an age of the first instance of the data segment after which the first instance of the data segment cannot be referenced by any additional data objects that are later stored to the deduplicating data system; a storing module that prevents storage-space fragmentation within the deduplicating data system by storing, based at least in part on determining that the age limit of the first instance of the data segment has been reached, a second instance of the data segment within the deduplicating data system that is referenced by the additional data object while also retaining the first instance of the data segment within the deduplicating data system; at least one processor that executes the identifying module, the determining module, and the storing module. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium comprising one or more computer-executable instructions that, when executed by at least one processor of a computing device, cause the computing device to:
-
identify a first instance of a data segment stored within a deduplicating data system that reduces redundant data storage by storing at least two data objects such that each of the two data objects references the first instance of the data segment; identify an additional data object to be stored by the deduplicating data system, wherein the additional data object comprises the data segment; determine that an age limit of the first instance of the data segment has been reached, wherein the age limit of the first instance of the data segment; limits the amount of time during which the first instance of the data segment is available to be referenced; and indicates an age of the first instance of the data segment after which the first instance of the data segment cannot be referenced by any additional data objects that are later stored to the deduplicating data system; prevent storage-space fragmentation within the deduplicating data system by causing the computing device to store, based at least in part on determining that the age limit of the first instance of the data segment has been reached, a second instance of the data segment within the deduplicating data system that is referenced by the additional data object while also retaining the first instance of the data segment within the deduplicating data system. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification