Identifying modified chunks in a data set for storage
First Claim
1. A method for processing modifications to a data set in storage, comprising:
- maintaining information on a data set of variable length chunks, including a digest of each chunk and information to locate the chunk in the data set;
receiving modifications to at least one of the chunks in the data set;
determining at least one range of least one of the chunks including data affected by the modifications, wherein each range identifies one chunk or sequential chunks having data affected by the modifications;
processing the at least one chunk in each range to determine at least one new chunk in each range, and for each determined new chunk, a digest of the new chunk;
determining whether at least one chunk outside of the at least one range has changed;
for each determined at least one chunk outside of the at least one range that has changed, determining at least one new chunk and a new digest of the at least one new chunk; and
adding to the set information the new digest information on the at least one new chunk and information to locate the new chunk in the data set.
0 Assignments
0 Petitions
Accused Products
Abstract
Provided are a computer program product, system, and method for identifying modified chunks in a data set for storage. Modifications are received to at least one of the chunks in the data set. A determination is made of at least one range of least one of the chunks including data affected by the modifications determination is made as to whether at least one chunk outside of the at least one range has changed. For each determined at least one chunk outside of the at least one range that has changed, a determination is made of at least one new chunk and a new digest of the at least one new chunk and information is added on the at least one new chunk and information to locate the new chunk in the data set.
-
Citations
9 Claims
-
1. A method for processing modifications to a data set in storage, comprising:
-
maintaining information on a data set of variable length chunks, including a digest of each chunk and information to locate the chunk in the data set; receiving modifications to at least one of the chunks in the data set; determining at least one range of least one of the chunks including data affected by the modifications, wherein each range identifies one chunk or sequential chunks having data affected by the modifications; processing the at least one chunk in each range to determine at least one new chunk in each range, and for each determined new chunk, a digest of the new chunk; determining whether at least one chunk outside of the at least one range has changed; for each determined at least one chunk outside of the at least one range that has changed, determining at least one new chunk and a new digest of the at least one new chunk; and adding to the set information the new digest information on the at least one new chunk and information to locate the new chunk in the data set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
Specification