Systems and methods for selective data replication
First Claim
1. A computer-implemented method for performing data replication, the computer-implemented method comprising:
- performing a first assessment with computer hardware on a first set of files stored on a first storage device that is associated with a source system and on a second set of files stored on a second storage device that is associated with a destination system in networked communication with the source system, the second set of files corresponding to the first set of files and having been previously replicated from the first set of files, the first assessment comprising,comparing one or more attributes of the first set of files with one or more corresponding attributes of the second set of files, anddetermining whether a first attribute from the one or more attributes of a file from the first set of files differs from a corresponding attribute of the one or more corresponding attributes of a corresponding file from the second set of files;
in response to determining that the first attribute differs from the corresponding attribute, determining with the computer hardware whether a size of the file satisfies a size threshold value;
in response to determining that the size of the file satisfies the size threshold value, performing with the computer hardware a second assessment on the file based at least in part on a checksum for the file; and
based on the results of the second assessment, determining whether to replicate the file from the first storage device to the second storage device.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for performing data replication are disclosed. Determining whether to update replicated data typically involves comparison of readily obtainable attributes of a given source file and its corresponding replicated file. Such attributes can be obtained from, for example, metadata. In certain situations, an additional assessment of the source and replicated files can be beneficial. For example, if integrity of an existing replicated file'"'"'s content is maintained, one may not want to re-replicate the corresponding source file. For large source files, such a decision can provide substantial reductions in expenditures of available computing and network resources. In certain embodiments, a threshold for identifying such large files can be based on one or more operating parameters such as network type and available bandwidth. In certain embodiments, replication file'"'"'s integrity can be checked by calculating and comparing checksums for the replication file and its corresponding source file.
-
Citations
20 Claims
-
1. A computer-implemented method for performing data replication, the computer-implemented method comprising:
-
performing a first assessment with computer hardware on a first set of files stored on a first storage device that is associated with a source system and on a second set of files stored on a second storage device that is associated with a destination system in networked communication with the source system, the second set of files corresponding to the first set of files and having been previously replicated from the first set of files, the first assessment comprising, comparing one or more attributes of the first set of files with one or more corresponding attributes of the second set of files, and determining whether a first attribute from the one or more attributes of a file from the first set of files differs from a corresponding attribute of the one or more corresponding attributes of a corresponding file from the second set of files; in response to determining that the first attribute differs from the corresponding attribute, determining with the computer hardware whether a size of the file satisfies a size threshold value; in response to determining that the size of the file satisfies the size threshold value, performing with the computer hardware a second assessment on the file based at least in part on a checksum for the file; and based on the results of the second assessment, determining whether to replicate the file from the first storage device to the second storage device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A data replication system comprising:
a replication agent comprising computer hardware, the replication agent in communication with a source storage system and a destination storage system, the replication agent configured to; perform a first assessment of a file stored on the source storage system to determine whether the file has a metadata attribute that differs from a corresponding metadata attribute of a corresponding file on the destination storage system; in response to a determination that the metadata attribute differs from the corresponding metadata attribute, the replication agent is further configured to; obtain a size of the file; and compare the size of the file to a size threshold value; in response to a determination that the size of the file does not satisfy the size threshold value, the replication agent is further configured to replicate the file from the source storage device to the destination storage device regardless of whether a checksum for the file matches a checksum for the corresponding file; and in response to a determination that the size of the file satisfies the size threshold value, the replication agent is further configured perform a second assessment of the file based at least in part on the checksum for the file. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18)
-
19. Non-transitory computer storage comprising instructions which, when executed, cause the computing system to perform steps comprising:
-
performing a first assessment on a first file stored on a first storage device that is associated with a source system and on a second file stored on a second storage device that is associated with a destination system, the second file corresponding to the first file and having been previously replicated from the first file, the first assessment comprising, comparing an attribute of the first file with a corresponding attribute of the second file, and determining whether the attribute from the first file differs from the corresponding attribute of the second file; in response to determining that the attribute differs from the corresponding attribute, determining whether a size of the first file satisfies a threshold value; in response to determining that the size of the first file satisfies the threshold value, performing a second assessment on the first file based at least in part on the checksum for the first file; and based on the results of the second assessment, determining whether to replicate the file from the first storage device to the second storage device. - View Dependent Claims (20)
-
Specification