Method and Device for Processing Failure in at Least One Distributed Cluster, and System
First Claim
1. A method for processing a failure in at least one distributed cluster, comprising:
- receiving, by a first secondary node, a heartbeat message from a first reference node, wherein the heartbeat message comprises first indication information indicating that the first reference node is disconnected from a first Master node;
determining, by the first secondary node according to the first indication information, that the first reference node is disconnected from the first Master node;
detecting, by the first secondary node, whether a heartbeat message from the first Master node to the first secondary node is interrupted;
determining, by the first secondary node, that the first secondary node is also disconnected from the first Master node when the heartbeat message from the first Master node to the first secondary node is interrupted; and
determining, by the first secondary node, that the first Master node disconnected from both the first secondary node and the first reference node is faulty,wherein the at least one distributed cluster comprises a first distributed cluster, andwherein the first distributed cluster comprises the first Master node, a first Slave node, the first reference node, and the first secondary node that serves as a backup of the first Master node.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and a device for processing a failure in at least one distributed cluster, and a system, where the at least one distributed cluster includes a first distributed cluster. The first distributed cluster includes a first Master node, a first Slave node, a first reference node, and a first secondary node that serves as a backup of the first Master node. The first secondary node receives a heartbeat message that includes first indication information. The first secondary node determines, according to the first indication information, that the first reference node is disconnected from the first Master node. The first secondary node determines that the first secondary node is also disconnected from the first Master node when it is detected that a heartbeat message from the first Master node to the first secondary node is interrupted. The first secondary node determines the first Master node is faulty.
-
Citations
22 Claims
-
1. A method for processing a failure in at least one distributed cluster, comprising:
-
receiving, by a first secondary node, a heartbeat message from a first reference node, wherein the heartbeat message comprises first indication information indicating that the first reference node is disconnected from a first Master node; determining, by the first secondary node according to the first indication information, that the first reference node is disconnected from the first Master node; detecting, by the first secondary node, whether a heartbeat message from the first Master node to the first secondary node is interrupted; determining, by the first secondary node, that the first secondary node is also disconnected from the first Master node when the heartbeat message from the first Master node to the first secondary node is interrupted; and determining, by the first secondary node, that the first Master node disconnected from both the first secondary node and the first reference node is faulty, wherein the at least one distributed cluster comprises a first distributed cluster, and wherein the first distributed cluster comprises the first Master node, a first Slave node, the first reference node, and the first secondary node that serves as a backup of the first Master node. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for processing a failure in at least one distributed cluster, comprising:
-
receiving, by a first Master node, a heartbeat message from a first secondary node, wherein the heartbeat message comprises third indication information indicating that the first secondary node is disconnected from a first Slave node; determining, by the first Master node according to the third indication information, that the first secondary node is disconnected from the first Slave node; detecting, by the first Master node, whether a heartbeat message from the first Slave node to the first Master node is interrupted; determining, by the first Master node, that the first Master node is also disconnected from the first Slave node when it is detected that the heartbeat message from the first Slave node to the first Master node is interrupted; and determining, by the first Master node, that the first Slave node disconnected from both the first Master node and the first secondary node is faulty, wherein the at least one distributed cluster comprises a first distributed cluster, and wherein the first distributed cluster comprises the first Master node, the first Slave node, a first reference node, and the first secondary node that serves as a backup of the first Master node. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A device for processing a failure in at least one distributed cluster, comprising:
-
a first receiver configured to receive a heartbeat message comprising a first indication information from a first reference node; a processor coupled to the first receiver and configured to; determine, according to the first indication information received by the first receiver, that the first reference node is disconnected from a first Master node; detect whether a heartbeat message from the first Master node to the device is interrupted; determine that the device is also disconnected from the first Master node when the heartbeat message from the first Master node to the device is interrupted; and determine, that the first Master node is faulty, wherein the at least one distributed cluster comprises a first distributed cluster, wherein the first distributed cluster comprises the first Master node, a first Slave node, the first reference node, and a first secondary node that serves as a backup of the first Master node, and wherein the device is the first secondary node. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A device for processing a failure in at least one distributed cluster, comprising:
-
a first receiver configured to receive a heartbeat message from a first secondary node comprising third indication information; a processor coupled to the first receiver and configured to; determine, according to the third indication information received by the first receiver, that the first secondary node is disconnected from a first Slave node; detect whether a heartbeat message from the first Slave node to the device is interrupted; determine that the device is also disconnected from the first Slave node when the heartbeat message from the first Slave node to the device is interrupted; and determine, that the first Slave node is faulty, wherein the at least one distributed cluster comprises a first distributed cluster, wherein the first distributed cluster comprises a first Master node, the first Slave node, a first reference node, and the first secondary node that serves as a backup of the first Master node, and wherein the device is the first Master node. - View Dependent Claims (17, 18, 19, 20)
-
-
21. A distributed cluster system, comprising a first distributed cluster, wherein the first distributed cluster comprises:
-
a first Master node; a first Slave node coupled to the first Master node; a first reference node coupled to the first Master node and the first Slave node; and a first secondary node coupled to the first Master node, the first Slave node and the first reference node and serves as a backup of the first Master node, wherein the first Master node, the first secondary node, and the first reference node are configured to send a heartbeat message to each other, wherein the first reference node is further configured to send a heartbeat message to the first Slave node, wherein the first Slave node is configured to separately send a heartbeat message to the first Master node and the first secondary node, wherein the first reference node is further configured to send a heartbeat message to the first secondary node when it is detected that the heartbeat message from the first Master node to the first reference node is interrupted, wherein the heartbeat message comprises first indication information indicating that the first reference node is disconnected from the first Master node, and wherein the first secondary node comprises; a first receiver configured to receive the heartbeat message from the first reference node comprising the first indication information; and a processor coupled to the first receiver and configured to; determine, according to the first indication information received by the first receiver, that the first reference node is disconnected from the first Master node; detect whether a heartbeat message from the first Master node to the first secondary node is interrupted; determine that the first secondary node is also disconnected from the first Master node when the heartbeat message from the first Master node to the first secondary node is interrupted; and determine, that the first Master node is faulty.
-
-
22. A distributed cluster system, comprising a first distributed cluster, wherein the first distributed cluster comprises:
-
a first Master node; a first Slave node coupled to the first Master node; a first reference node coupled to the first Master node and the first Slave node; and a first secondary node coupled to the first Master node, the first Slave node and the first reference node and serves as a backup of the first Master node, wherein the first Master node, the first secondary node, and the first reference node are configured to send a heartbeat message to each other, wherein the first reference node is further configured to send a heartbeat message to the first Slave node, wherein the first Slave node is configured to separately send a heartbeat message to the first Master node and the first secondary node, wherein the first secondary node is further configured to send a heartbeat message to the first Master node when it is detected that the heartbeat message from the first Slave node to the first secondary node is interrupted, wherein the heartbeat message comprises third indication information indicating that the first secondary node is disconnected from the first Slave node, and wherein the first Master node comprises; a first receiver configured to receive the heartbeat message from the first secondary node comprising the third indication information; a processor coupled to the first receiver and configured to; determine, according to the third indication information received by the first receiver, that the first secondary node is disconnected from the first Slave node; detect whether the heartbeat message from the first Slave node to the first Master node is interrupted; determine that the first Master node is also disconnected from the first Slave node when the heartbeat message from the first Slave node to the first Master node is interrupted; and determine, that the first Slave node is faulty.
-
Specification