DYNAMICALLY CHANGING MEMBERS OF A CONSENSUS GROUP IN A DISTRIBUTED SELF-HEALING COORDINATION SERVICE
First Claim
1. A method for managing a consensus group in a distributed computing cluster, the method comprising:
- determining that an instance of an authority module executing on a first node, of a consensus group of nodes in the distributed computing cluster, has failed; and
adding, by an instance of the authority module on a second node of the consensus group, a new node to the consensus group to replace the first node, wherein the new node comprises a node in the computing cluster that was not a member of the consensus group at the time the instance of the authority module executing on the first node is determined to have failed.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods, and computer program products for managing a consensus group in a distributed computing cluster, by determining that an instance of an authority module executing on a first node, of a consensus group of nodes in the distributed computing cluster, has failed; and adding, by an instance of the authority module on a second node of the consensus group, a new node to the consensus group to replace the first node. The new node is a node in the computing cluster that was not a member of the consensus group at the time the instance of the authority module executing on the first node is determined to have failed.
-
Citations
21 Claims
-
1. A method for managing a consensus group in a distributed computing cluster, the method comprising:
-
determining that an instance of an authority module executing on a first node, of a consensus group of nodes in the distributed computing cluster, has failed; and adding, by an instance of the authority module on a second node of the consensus group, a new node to the consensus group to replace the first node, wherein the new node comprises a node in the computing cluster that was not a member of the consensus group at the time the instance of the authority module executing on the first node is determined to have failed. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A distributed computing cluster, comprising:
-
a plurality of nodes configured to provide a service to clients, each node having a processor and a memory; a consensus group of nodes formed from a subset of the plurality of nodes, the consensus group of nodes, each executing an instance of an authority module which performs operations for managing the consensus group, the operation, comprising; determining that an instance of an authority module executing on a first node of the consensus group, has failed, and adding, by an instance of the authority module on a second node of the consensus group, a new node to the consensus group to replace the first node, wherein the new node comprises a node in the computing cluster that was not a member of the consensus group at the time the instance of the authority module executing on the first node is determined to have failed. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable storage medium storing instructions, which, when executed on a processor, perform operations for managing a consensus group in a distributed computing cluster, the operation comprising:
-
determining that an instance of an authority module executing on a first node, of a consensus group of nodes in the distributed computing cluster, has failed; and adding, by an instance of the authority module on a second node of the consensus group, a new node to the consensus group to replace the first node, wherein the new node comprises a node in the computing cluster that was not a member of the consensus group at the time the instance of the authority module executing on the first node is determined to have failed. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification