System and method for targeted consistency improvement in a distributed storage system
First Claim
1. A computer-implemented method for targeted consistency improvement within a distributed storage system, the method comprising:
- utilizing multiple memory-efficient sets to generate consistency metrics for different replicated data objects stored within the distributed storage system, wherein the consistency metrics are based on a measure of inconsistent read operations performed on the replicated data objects;
wherein an inconsistent read operation includes retrieving from a replicated data object, a value that is older than the most recent write operation performed on that replicated data object;
identifying a subset of the replicated data objects having consistency metrics that indicate a lower level of consistency than the consistency metrics of other replicated data objects; and
for each given replicated data object of said subset;
identifying multiple individual data objects that represent the given replicated data object within the distributed data store;
identifying the individual data object storing a most recent value relative to values of the other individual data objects; and
writing the most recent value to each of the other individual data objects.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments may include a consistency measurement component that utilizes memory-efficient sets (e.g., Bloom filters) to generate consistency metrics for read operations performed on different replicated data objects of distributed storage system. Based on the consistency metrics, the consistency measurement component may identify a subset of replicated data objects associated with low levels of consistency. The consistency measurement component may target this subset for consistency improvement by generating instructions to improve the consistency of the subset. In other cases, the consistency measurement component may notify a consistency improvement component about the targeted subset. In response, the consistency improvement component may generate instructions to improve the consistency of the targeted subset.
41 Citations
27 Claims
-
1. A computer-implemented method for targeted consistency improvement within a distributed storage system, the method comprising:
-
utilizing multiple memory-efficient sets to generate consistency metrics for different replicated data objects stored within the distributed storage system, wherein the consistency metrics are based on a measure of inconsistent read operations performed on the replicated data objects; wherein an inconsistent read operation includes retrieving from a replicated data object, a value that is older than the most recent write operation performed on that replicated data object; identifying a subset of the replicated data objects having consistency metrics that indicate a lower level of consistency than the consistency metrics of other replicated data objects; and for each given replicated data object of said subset; identifying multiple individual data objects that represent the given replicated data object within the distributed data store; identifying the individual data object storing a most recent value relative to values of the other individual data objects; and writing the most recent value to each of the other individual data objects. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-readable storage medium, storing program instructions computer-executable on a computer system to:
-
utilize multiple memory-efficient sets to generate consistency metrics for different replicated data objects stored within a distributed storage system, wherein the consistency metrics are based on a measure of inconsistent read operations performed on the replicated data objects;
wherein an inconsistent read operation includes retrieving from a replicated data object, a value that is older than the most recent write operation performed on that replicated data object;identify a subset of the replicated data objects having consistency metrics that indicate a lower level of consistency than the consistency metrics of other replicated data objects; and for each given replicated data object of said subset, perform one or more operations to improve the data consistency of that replicated data object. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system, comprising:
-
a memory; and one or more processors coupled to the memory, wherein the memory comprises program instructions executable by the one or more processors to; utilize multiple memory-efficient sets to generate consistency metrics for different replicated data objects stored within a distributed storage system, wherein the consistency metrics are based on a measure of inconsistent read operations performed on the replicated data objects;
wherein an inconsistent read operation includes retrieving from a replicated data object, a value that is older than the most recent write operation performed on that replicated data object;identify a subset of the replicated data objects having consistency metrics that indicate a lower level of consistency than the consistency metrics of other replicated data objects; and provide an indication of the replicated data objects within said subset to a consistency improvement component configured to improve consistency of replicated data objects. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
Specification