Hybrid cluster recovery techniques
First Claim
Patent Images
1. A system, comprising:
- memory storing program instructions that, if executed, cause one or more processors to;
determine that a replacement master node of a first cluster either cannot be elected by nodes of the first cluster using a first election protocol, or has a probability below a particular threshold of being elected by the nodes of the first cluster using the first election protocol;
select, responsive to the determination, and based at least in part on an indication of more progress of replication of one or more data items at a first node of the first cluster than at one or more other nodes of the first cluster, the first node as the replacement master node; and
transmit, by a first recovery manager to at least one node of the first cluster, an indication that the first node has been selected as the replacement master node.
1 Assignment
0 Petitions
Accused Products
Abstract
At a recovery manager associated with a cluster, a determination is made as to whether a replacement for a first node of the cluster can be elected by the other nodes of the cluster using a first election protocol. The recovery manager selects a second node of the cluster as a replacement for the first node, based on data item replication progress made at the node, and transmits an indication that the second node has been selected to one or more nodes of the cluster.
-
Citations
20 Claims
-
1. A system, comprising:
memory storing program instructions that, if executed, cause one or more processors to; determine that a replacement master node of a first cluster either cannot be elected by nodes of the first cluster using a first election protocol, or has a probability below a particular threshold of being elected by the nodes of the first cluster using the first election protocol; select, responsive to the determination, and based at least in part on an indication of more progress of replication of one or more data items at a first node of the first cluster than at one or more other nodes of the first cluster, the first node as the replacement master node; and transmit, by a first recovery manager to at least one node of the first cluster, an indication that the first node has been selected as the replacement master node. - View Dependent Claims (2, 3, 4, 5)
-
6. A method, comprising:
-
determining that a replacement master node of a first cluster either cannot be elected by nodes of the first cluster using a first election protocol, or has a probability below a particular threshold of being elected by the nodes of the first cluster using the first election protocol; selecting, responsive to the determination, a first node of the first cluster as the replacement master node of the first cluster, wherein said selecting is based at least in part on an indication of more progress of replication of one or more data items at the first node than at one or more other nodes of the first cluster; and transmitting, by a first recovery manager to at least one node of the first cluster, an indication that the first node has been selected as the replacement master node. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A non-transitory computer-accessible storage medium storing program instructions that when executed on one or more processors cause the one or more processors to perform a method comprising:
-
determining, based at least in part on one or more queries directed to one or more nodes of a first cluster, that a replacement for a first node of the first cluster either cannot be elected by nodes of the first cluster using a first election protocol, or has a probability below a particular threshold of being elected by the nodes of the first cluster using the first election protocol; selecting, responsive to the determination, a second node of the one or more nodes as the replacement for the first node, wherein said selecting is based at least in part on an indication of more progress of replication of one or more data items at the second node of the first cluster than at one or more other nodes of the first cluster; and transmitting, by a first recovery manager to at least one node of the first cluster, an indication that the second node has been selected as the replacement for the first node. - View Dependent Claims (17, 18, 19, 20)
-
Specification