Storing chunks in containers
First Claim
Patent Images
1. A method comprising:
- storing, in a system having a processor, chunks in a first container of a data store, wherein the chunks are produced by dividing input data as part of a deduplication process,determining, by the system, whether a size of the first container has reached a predefined size threshold; and
in response to determining that the size of the first container has reached the predefined size threshold, moving at least one of the chunks in the first container to a second container, wherein moving the at least one of the chunks in the first container to the second container comprises;
redistributing chunks of a plurality of containers including the first container and the second container, wherein after the redistributing, each of the plurality of containers has a respective size below a second predefined size threshold.
2 Assignments
0 Petitions
Accused Products
Abstract
Chunks are stored in a container of a data store, where the chunks are produced by dividing input data as part of a deduplication process. In response to determining that the size of the container has reached a predefined size threshold, at least one of the chunks in the container is moved to another container.
70 Citations
18 Claims
-
1. A method comprising:
-
storing, in a system having a processor, chunks in a first container of a data store, wherein the chunks are produced by dividing input data as part of a deduplication process, determining, by the system, whether a size of the first container has reached a predefined size threshold; and in response to determining that the size of the first container has reached the predefined size threshold, moving at least one of the chunks in the first container to a second container, wherein moving the at least one of the chunks in the first container to the second container comprises; redistributing chunks of a plurality of containers including the first container and the second container, wherein after the redistributing, each of the plurality of containers has a respective size below a second predefined size threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising:
-
storing, in a system having a processor, chunks in a first container of a data store, wherein the chunks are produced by dividing input data as part of a deduplication process, determining, by the system, whether a size of the first container has reached a predefined size threshold; in response to determining that the size of the first container has reached the predefined size threshold, moving at least one of the chunks in the first container to a second container, wherein moving the at least one of the chunks in the first container to the second container comprises; splitting, by the system, the first container into plural containers, wherein the second container is one of the plural containers; and tracking a number of times the first container has been part of a split, wherein a number of the plural containers split from the first container depends on the tracked number of times.
-
-
10. A method comprising:
-
storing, in a system having a processor, chunks in a first container of a data store, wherein the chunks are produced by dividing input data as part of a deduplication process, determining, by the system, whether a size of the first container has reached a predefined size threshold; in response to determining that the size of the first container has reached the predefined size threshold, moving at least one of the chunks in the first container to a second container; and maintaining neighbor information for at least one of the chunks in the first container, wherein the neighbor information identifies which of the chunks are known to be neighbors of others of the chunks, wherein the neighbor information further indicates, for a given pair of neighbors, which of the neighbors in the pair is on the left side.
-
-
11. A method comprising:
-
storing, in a system having a processor, chunks in a first container of a data store, wherein the chunks are produced by dividing input data as part of a deduplication process, determining, by the system, whether a size of the first container has reached a predefined size threshold; and in response to determining that the size of the first container has reached the predefined size threshold, moving at least one of the chunks in the first container to a second container, wherein moving the at least one chunk is performed as part of a housekeeping procedure that removes garbage chunks associated with deleted data.
-
-
12. A computer system comprising:
-
a storage media to store a data store having containers of chunks; and at least one processor to; receive input data; divide the input data into chunks as part of a deduplication process; determine which of the input data chunks correspond to chunks already present in the data store, and which of the input data chunks are new; in response to identifying a new input data chunk, store the new input data chunk into one of the containers according to a neighbor condition; in response to determining that a size of a particular one of the containers has reached a predefined size threshold, moving at least one chunk from the particular container to another container; rewrite a given one of the containers to remove garbage chunks from the given container; determine whether a size of the given container is smaller than a second predefined size threshold; and in response to determining that the size of the given container is smaller than the second predefined size threshold, merge the given container with another container.
-
-
13. A computer system comprising:
-
a storage media to store a data store having containers of chunks; and at least one processor to; receive input data; divide the input data into chunks as part of a deduplication process; determine which of the input data chunks correspond to chunks already present in the data store, and which of the input data chunks are new; in response to identifying a new input data chunk, store the new input data chunk into one of the containers according to a neighbor condition; and in response to determining that a size of a particular one of the containers has reached a predefined size threshold, moving at least one chunk from the particular container to another container, wherein moving of the at least one chunk from the particular container to the another container is performed by redistributing chunks of a plurality of containers including the particular container and the another container, wherein after the redistributing, each of the plurality of containers has a respective size below a second predefined size threshold. - View Dependent Claims (14, 15, 16)
-
-
17. An article comprising at least one computer-readable storage medium storing instructions that upon execution cause a computer system to:
-
store chunks in a first container of a data store, wherein the chunks are produced by dividing input data as part of a deduplication process; determine whether a size of the first container has reached a predefined size threshold; and in response to determining that the size of the first container has reached the predefined size threshold, move at least one of the chunks in the first container to a second container, wherein the data store further includes a third container and a fourth container storing corresponding chunks; merge the third and fourth containers into a resulting container; and split the resulting container into plural containers. - View Dependent Claims (18)
-
Specification