Methods and systems for file replication utilizing differences between versions of files
First Claim
1. A computer implemented method for comparing data of a first and a revised version of a file to determine differences between the versions, comprising:
- segmenting the first and the revised versions into blocks of data of equal size; and
moving a reference frame of a set resolution across adjacent portions of one of the blocks of the revised version to obtain plural coarse signatures of the one of the blocks of the revised version for comparing to a coarse signature of a comparable one of the blocks of the first version;
recursively moving the reference frame by shifting the reference frame in increments of one byte or one bit until a match of the comparing of the coarse signatures is found, if possible; and
upon finding the match, comparing a fine signature of the one of the blocks of the revised version to a fine signature of the comparable one of the blocks of the first version.
5 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for efficient file replication are provided. In some embodiments, one or more coarse signatures for blocks in a base file are compared with those coarse signatures for blocks of a revised file, until a match is found. A fine signature is then generated for the matching block of the revised file and compared to a fine signature of the base file. Thus, fine signatures are not computed unless a coarse signature match has been found, thereby minimizing unneeded time-consuming fine signature calculations. Methods are also provided for determining whether to initiate a delta file generation algorithm, or whether to utilize a more efficient replication method, based upon system and/or file parameters. In accordance with additional embodiments, the lengths of valid data on physical blocks are obtained from physical block mappings for the files, and these lengths and mappings are utilized for delta file generation, to minimize unnecessary signature computations.
-
Citations
15 Claims
-
1. A computer implemented method for comparing data of a first and a revised version of a file to determine differences between the versions, comprising:
-
segmenting the first and the revised versions into blocks of data of equal size; and moving a reference frame of a set resolution across adjacent portions of one of the blocks of the revised version to obtain plural coarse signatures of the one of the blocks of the revised version for comparing to a coarse signature of a comparable one of the blocks of the first version; recursively moving the reference frame by shifting the reference frame in increments of one byte or one bit until a match of the comparing of the coarse signatures is found, if possible; and upon finding the match, comparing a fine signature of the one of the blocks of the revised version to a fine signature of the comparable one of the blocks of the first version. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer implemented method for comparing data of a first and a revised version of a file to determine differences between the versions, comprising:
-
segmenting the first and the revised versions into blocks of data of equal size; moving a reference frame of a set resolution across adjacent portions of one of the blocks of the revised version to obtain plural coarse signatures of the one of the blocks of the revised version for comparing to a coarse signature of a comparable one of the blocks of the first version, the set resolution of the reference frame being a number of bits less than a second number of bits defining said equal size of the blocks of data; recursively moving the reference frame until a match of the comparing of the coarse signatures is found, if possible; and upon finding the match, further including calculating a fine signature of some of the data of the revised version. - View Dependent Claims (10, 11, 12, 14, 15)
-
-
13. The method of 12, wherein the bridging the reference frame further includes accessing data in different memory locations.
Specification