Confirming data consistency in a data storage environment
First Claim
1. A method for confirming validity of replicated data at a data storage site, the method comprising:
- a) replicating first data from a first computer readable storage medium at a first data storage site as second data to a second computer readable storage medium at a second data storage site;
b) transmitting a hash function, executable by a computer processor, and a hash seed via a computer network from at least one of the first or second data storage sites to the other of the first or second data storage sites so that the hash function and hash seed are available for use by both first and second data storage sites;
c) modifying the first data stored on the first computer readable storage medium at the first data storage site based on the hash seed, and utilizing the hash function, computing a first hash value based on the seed-modified first data, the first hash value being smaller in size than the first data;
d) modifying the second data stored on the second computer readable storage medium at the second data storage site based on the hash seed, and utilizing the hash function, computing a second hash value based on the seed-modified second data, the second hash value being smaller in size than the second data;
e) transmitting at least one of the first or second hash values via a computer network for comparing with the other of the first or second hash values, instead of transmitting the larger sized first or second data via the computer network; and
f) comparing the first and second hash values, in lieu of comparing the actual first and second data, to determine whether the second data is a valid replication of the first data, wherein a mismatch between the first and second hash values indicates that at least one of the first or second data storage sites includes invalid data.
15 Assignments
0 Petitions
Accused Products
Abstract
A method for confirming replicated data at a data site, including utilizing a hash function, computing a first hash value based on first data at a first data site and utilizing the same hash function, computing a second hash value based on second data at a second data site, wherein the first data had previously been replicated from the first data site to the second data site as the second data. The method also includes comparing the first and second hash values to determine whether the second data is a valid replication of the first data. In additional embodiments, the first data may be modified based on seed data prior to computing the first hash value and the second data may be modified based on the same seed data prior to computing the second hash value. The process can be repeated to increase reliability of the results.
-
Citations
13 Claims
-
1. A method for confirming validity of replicated data at a data storage site, the method comprising:
-
a) replicating first data from a first computer readable storage medium at a first data storage site as second data to a second computer readable storage medium at a second data storage site; b) transmitting a hash function, executable by a computer processor, and a hash seed via a computer network from at least one of the first or second data storage sites to the other of the first or second data storage sites so that the hash function and hash seed are available for use by both first and second data storage sites; c) modifying the first data stored on the first computer readable storage medium at the first data storage site based on the hash seed, and utilizing the hash function, computing a first hash value based on the seed-modified first data, the first hash value being smaller in size than the first data; d) modifying the second data stored on the second computer readable storage medium at the second data storage site based on the hash seed, and utilizing the hash function, computing a second hash value based on the seed-modified second data, the second hash value being smaller in size than the second data; e) transmitting at least one of the first or second hash values via a computer network for comparing with the other of the first or second hash values, instead of transmitting the larger sized first or second data via the computer network; and f) comparing the first and second hash values, in lieu of comparing the actual first and second data, to determine whether the second data is a valid replication of the first data, wherein a mismatch between the first and second hash values indicates that at least one of the first or second data storage sites includes invalid data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An information handling system comprising:
-
a first data storage site comprising a computer readable storage medium storing first data, and a computer processor computing a first hash value based on the first data and a hash seed, utilizing a hash function; and a second data storage site comprising a computer readable storage medium storing data replicated from the first data storage site and a computer processor computing a second hash value based on second data and a hash seed, utilizing the same hash function; wherein at least one of the first data storage site and second data storage site is configured to transmit the hash function and hash seed via a computer network to the other of the first data storage site or second data storage site so that the hash function and hash seed are available for use by both first and second data storage sites; and wherein at least one of the first data storage site and second data storage site is configured to transmit its computed hash value via the computer network to the other of the first data storage site and second data storage site for comparison of the first hash value with the second hash value, in lieu of comparing the actual first and second data, to determine whether the second data is a valid replication of the first data, wherein a mismatch between the first and second hash values indicates that at least one of the first or second data storage sites includes invalid data. - View Dependent Claims (10)
-
-
11. A method for confirming validity of replicated data at a data storage site, the method comprising:
-
a) replicating first data from a first computer readable storage medium at a first data storage site as second data to a second computer readable storage medium at a second data storage site; b) transmitting a hash function, executable by a computer processor, and a hash seed via a computer network from at least one of the first or second data storage sites to the other of the first or second data storage sites so that the hash function and hash seed are available for use by both first and second data storage sites; c) modifying a selected portion of the first data stored on the first computer readable storage medium at the first data storage site based on the hash seed, and utilizing the hash function, computing a first hash value based on the seed-modified selected portion of the first data; d) modifying a selected portion of the second data stored on the second computer readable storage medium at the second data storage site based on the hash seed, and utilizing the hash function, computing a second hash value based on the seed-modified selected portion of the second data, the selected portion of the second data corresponding to the selected portion of the first data; e) transmitting at least one of the first or second hash values via a computer network for comparing with the other of the first or second hash values, instead of transmitting the larger sized first or second data via the computer network; f) comparing the first and second hash values, in lieu of comparing the actual first and second data, to determine whether the selected portion of second data is a valid replication of the selected portion of first data; and f) repeating steps c) through f) a plurality of times, each time utilizing a different selected portion of the first data and corresponding selected portion of the second data than in a previous time, wherein a mismatch between the first and second hash values indicates that at least one of the first or second data storage sites includes invalid data. - View Dependent Claims (12, 13)
-
Specification