Content aligned block-based deduplication
First Claim
Patent Images
1. A method for defining deduplication block alignments within a data segment, the method comprising:
- iteratively performing a block alignment function on data within a sliding window in the data segment and, for each iterative performance of the block alignment function;
in response to determining that an output of the block alignment function performed on a current window of data satisfies one or more predetermined criteria;
establishing, with one or more computer processors, a deduplication data block having a predetermined block size; and
moving the sliding window relative to the data segment by an amount based at least in part on the predetermined block size before performing a next consecutive iteration; and
in response to determining that the output of the block alignment function performed on the current window of data does not satisfy the one or more predetermined criteria;
moving the sliding window relative to the data segment by an incremental amount that is distinct from the predetermined block size before performing the next consecutive iteration and without establishing a deduplication data block, wherein gaps of data not belonging to any deduplication data block exist between established deduplication data blocks following performance of the block alignment function across the data segment.
2 Assignments
0 Petitions
Accused Products
Abstract
A content alignment system according to certain embodiments aligns a sliding window at the beginning of a data segment. The content alignment system performs a block alignment function on the data within the sliding window. A deduplication block is established if the output of the block alignment function meets a predetermined criteria. At least part of a gap is established if the output of the block alignment function does not meet the predetermined criteria. The predetermined criteria is changed if a threshold number of outputs fail to meet the predetermined criteria.
341 Citations
18 Claims
-
1. A method for defining deduplication block alignments within a data segment, the method comprising:
iteratively performing a block alignment function on data within a sliding window in the data segment and, for each iterative performance of the block alignment function; in response to determining that an output of the block alignment function performed on a current window of data satisfies one or more predetermined criteria; establishing, with one or more computer processors, a deduplication data block having a predetermined block size; and moving the sliding window relative to the data segment by an amount based at least in part on the predetermined block size before performing a next consecutive iteration; and in response to determining that the output of the block alignment function performed on the current window of data does not satisfy the one or more predetermined criteria; moving the sliding window relative to the data segment by an incremental amount that is distinct from the predetermined block size before performing the next consecutive iteration and without establishing a deduplication data block, wherein gaps of data not belonging to any deduplication data block exist between established deduplication data blocks following performance of the block alignment function across the data segment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
10. A deduplication system configured to define deduplication block alignments within a data segment, the system comprising:
-
one or more processors implemented at least partially by hardware; and a deduplication block alignment module of executable instructions that when executed by the one or more processors, causes the one or more processors to iteratively perform a deduplication block alignment function on data within a sliding window in the data segment and, for each iterative performance of the deduplication block alignment function; based at least in part on a determination that an output of the deduplication block alignment function performed on the data within the sliding window satisfies one or more predetermined criteria, establish a deduplication block having a predetermined block size and move the sliding window relative to the data segment by an amount based at least in part on the predetermined block size before performing a next consecutive iteration; and based at least in part on a determination that the output of the deduplication block alignment function performed on the data within the sliding window does not satisfy the one or more predetermined criteria, define at least a portion of data not belonging to the deduplication block and move the sliding window relative to the data segment by an incremental amount that is different from the predetermined block size before performing the next consecutive iteration. - View Dependent Claims (11, 12, 13)
-
-
14. A deduplication system configured to define deduplication block alignments within a data segment, the system comprising:
-
one or more processors implemented at least partially by hardware; and a deduplication block alignment module of executable instructions that when executed by the one or more processors, causes the one or more processors to iteratively perform a deduplication block alignment function on data within a sliding window in the data segment and, for each iterative performance of the deduplication block alignment function determine whether an output of the deduplication block alignment function performed on the data within the sliding window satisfies one or more predetermined criteria, wherein based at least in part on a determination that the output of the deduplication block alignment function performed on the data within the sliding window satisfies the one or more predetermined criteria, the deduplication block alignment module causes the one or more processors to establish a first deduplication block having a predetermined block size, and based at least in part on a determination that the output of the deduplication block alignment function performed on a current window of data does not satisfy the one or more predetermined criteria for a threshold number of consecutive iterations, the deduplication block alignment module further causes the one or more processors to establish a second deduplication data block having the predetermined block size, and move the sliding window relative to the data segment by an incremental amount that is different from the predetermined block size.
-
-
15. A method for defining deduplication block alignments within a data segment, the method comprising:
iteratively performing a deduplication block alignment function on data within a sliding window in the data segment and, for each iterative performance of the deduplication block alignment function; establishing, with one or more computer processors, a deduplication data block having a predetermined block size and moving the sliding window relative to the data segment by an amount that is based at least in part on the predetermined block size before performing a next consecutive iteration in response to determining that an output of the deduplication block alignment function performed on the data within the sliding window satisfies one or more predetermined criteria; and defining, with the one or more computer processors, at least a portion of a gap of data not belonging to the deduplication data block and moving the sliding window relative to the data segment by an incremental amount that is different from the predetermined block size before performing the next consecutive iteration in response to determining that the output of the deduplication block alignment function performed on the data within the sliding window does not satisfy the one or more predetermined criteria. - View Dependent Claims (16, 17)
-
18. A method for defining deduplication block alignments within a data segment, the method comprising:
iteratively performing a deduplication block alignment function on data within a sliding window in the data segment and, for each iterative performance of the deduplication block alignment function determining whether an output of the deduplication block alignment function performed on the data within the sliding window satisfies one or more predetermined criteria, wherein based at least in part on a determination that the output of the deduplication block alignment function performed on the data within the sliding window satisfies one or more predetermined criteria, establishing, with one or more computer processors, a first deduplication data block having a predetermined block size, and based at least in part on a determination that the output of the deduplication block alignment function performed on the data within the sliding window does not satisfy the one or more predetermined criteria, establishing, with the one or more computer processors, a second deduplication data block having the predetermined block size, and moving the sliding window relative to the data segment by an incremental amount that is different from the predetermined block size.
Specification