Detecting and reconciling system resource metadata anomolies in a distributed storage system
First Claim
1. A system, comprising:
- a plurality of compute nodes implemented on one or more computing devices and configured to implement a distributed storage service;
wherein a control plane node of the plurality of compute nodes is configured to maintain system resource metadata for the distributed storage service;
wherein at least one storage node of the plurality of compute nodes is configured to detect one or more resource metadata anomalies;
wherein, to detect one or more resource metadata anomalies, the at least one storage node is configured to;
request resource metadata specific to the at least one storage node from the control plane node;
receive the resource metadata specific to the at least one storage node;
compare the received resource metadata with local resource information maintained at the least one storage node;
based, at least in part, on the comparison, identify the one or more resource metadata anomalies; and
wherein, the at least one storage node is further configured to;
for each of the detected one or more resource metadata anomalies, perform a corrective operation to reconcile the at least one storage node with the system resource metadata.
1 Assignment
0 Petitions
Accused Products
Abstract
A system that implements detection and reconciliation of system resource metadata for a distributed storage system is described. A node may obtain resource metadata specific to the node from another node that maintains system resource metadata for a distributed storage system. Based on the resource metadata specific to the node, a determination may be made that the node is not reconciled with the system resource metadata. A corrective operation may be performed to reconcile the node with the system resource metadata. A corrective operation may include terminating a resource, making unavailable a resource, modifying resource attributes, or sending a resource metadata update to system resource metadata for correction.
26 Citations
20 Claims
-
1. A system, comprising:
-
a plurality of compute nodes implemented on one or more computing devices and configured to implement a distributed storage service; wherein a control plane node of the plurality of compute nodes is configured to maintain system resource metadata for the distributed storage service; wherein at least one storage node of the plurality of compute nodes is configured to detect one or more resource metadata anomalies; wherein, to detect one or more resource metadata anomalies, the at least one storage node is configured to; request resource metadata specific to the at least one storage node from the control plane node; receive the resource metadata specific to the at least one storage node; compare the received resource metadata with local resource information maintained at the least one storage node; based, at least in part, on the comparison, identify the one or more resource metadata anomalies; and wherein, the at least one storage node is further configured to; for each of the detected one or more resource metadata anomalies, perform a corrective operation to reconcile the at least one storage node with the system resource metadata. - View Dependent Claims (2, 3, 4)
-
-
5. A method, comprising:
performing, by one or more computing devices implementing a node for a distributed system; obtaining resource metadata specific to the node from another node of the distributed system, wherein the other node maintains system resource metadata for the distributed system; based, at least in part, on the resource metadata specific to the node, determining that the node is not reconciled with the system resource metadata; and in response to determining that the node is not reconciled with the system resource metadata, performing a corrective operation to reconcile the node with the system resource metadata. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12)
-
13. A non-transitory, computer-readable storage medium, storing program instructions that when executed by one or more computing devices implementing a storage node of distributed storage service cause the storage node to implement:
-
obtaining resource metadata specific to the storage node from another node of a plurality of nodes including the storage node together implementing the distributed storage service, wherein the other node maintains system resource metadata for the distributed storage service; based, at least in part, on the resource metadata specific to the storage node, determining that the storage node is not reconciled with the system resource metadata; and in response to determining that the storage node is not reconciled with the system resource metadata, performing a corrective operation to reconcile the storage node with the system resource metadata. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification