Managing dereferenced chunks in a deduplication system
First Claim
Patent Images
1. A implemented-complemented method for maintaining, by a processor, data objects in a storage space, comprising:
- maintaining a chunk index having information on chunks in the storage space referenced in data objects, wherein the chunk index includes a reference count for each chunk indicating a number of the data objects in which the chunk is referenced and a reference measurement representing a level of the data objects references to the chunk;
selecting one chunk to remove from the storage space based on a criteria applied to the reference measurements of chunks having reference counts indicating that the chunks are not referenced in one of the data objects in the storage space, wherein the reference measurement of each chunk comprises a time most recently dereferenced indicating a time when the reference count for the chunk was decremented to indicate that the chunk is not referenced in one of the data objects; and
returning indication of the selected chunk to remove from the storage space.
0 Assignments
0 Petitions
Accused Products
Abstract
A chunk index has information on chunks in a storage space referenced in objects in the storage space. The chunk index includes a reference count for each chunk indicating a number of objects in which the chunk is referenced and a reference measurement representing a level of data object references to the chunk. One chunk is selected to remove from the storage space based on a criteria applied to the reference measurements of chunks having reference counts indicating that the chunks are not referenced in one object in the storage space.
-
Citations
21 Claims
-
1. A implemented-complemented method for maintaining, by a processor, data objects in a storage space, comprising:
-
maintaining a chunk index having information on chunks in the storage space referenced in data objects, wherein the chunk index includes a reference count for each chunk indicating a number of the data objects in which the chunk is referenced and a reference measurement representing a level of the data objects references to the chunk; selecting one chunk to remove from the storage space based on a criteria applied to the reference measurements of chunks having reference counts indicating that the chunks are not referenced in one of the data objects in the storage space, wherein the reference measurement of each chunk comprises a time most recently dereferenced indicating a time when the reference count for the chunk was decremented to indicate that the chunk is not referenced in one of the data objects; and returning indication of the selected chunk to remove from the storage space. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product for maintaining data objects in a storage space, the computer program product comprising a computer readable storage medium, including a hardware component, having computer readable program code embodied therein that executes to perform operations, the operations comprising:
-
maintaining a chunk index having information on chunks in the storage space referenced in the data objects, wherein the chunk index includes a reference count for each chunk indicating a number of the data objects in which the chunk is referenced and a reference measurement representing a level of the data objects references to the chunk; selecting one chunk to remove from the storage space based on a criteria applied to the reference measurements of chunks having reference counts indicating that the chunks are not referenced in one of the data objects in the storage space, wherein the reference measurement of each chunk comprises a time most recently dereferenced indicating a time when the reference count for the chunk was decremented to indicate that the chunk is not referenced in one of the data objects; and returning indication of the selected chunk to remove from the storage space. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system for maintaining data objects in a storage space, comprising:
-
a computer readable storage medium including a chunk index having information on chunks in the storage space referenced in objects, wherein the chunk index includes a reference count for each chunk indicating a number of the data objects in which the chunk is referenced and a reference measurement representing a level of the data objects references to the chunk; a processor executing code to perform operations comprising selecting one chunk to remove from the storage space based on a criteria applied to the reference measurements of chunks having reference counts indicating that the chunks are not referenced in one of the data objects in the storage space, wherein the reference measurement of each chunk comprises a time most recently dereferenced indicating a time when the reference count for the chunk was decremented to indicate that the chunk is not referenced in one of the data objects; and returning indication of the selected chunk to remove from the storage space. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification