Cluster node distress signal
First Claim
1. A node in a cluster of nodes, the node comprising:
- at least one processor;
a memory coupled to at least one processor;
a cluster node distress signaler residing in said memory, the cluster node distress signaler including a prebuilt distress message indicating failure of the node on a cluster, the cluster node distress signaler sending the prebuilt distress message to the other nodes in the cluster when a failure event is detected.
1 Assignment
0 Petitions
Accused Products
Abstract
The preferred embodiment of the present invention provides a cluster node distress system and method that improves the reliability of a cluster. The cluster node distress system provides a cluster node distress signal when a node on the cluster is about to fail. This allows the cluster to better to determine whether a non-communicating node has failed or has merely been partitioned from the cluster. The preferred cluster node distress system is embedded deeply into the operating system and provides a pre-built node distress signal that can be quickly sent to other nodes in the cluster when an imminent failure of that node is detected, improving the probability that the node distress signal will get out before the node totally fails. When the node distress signal is effectively sent to other nodes in the cluster, the cluster can accurately determine that the node has failed and has not just partitioned from the cluster. This allows the cluster to respond correctly, i.e., by assigning other nodes primary responsibility, with less intervention needed by administrators.
-
Citations
38 Claims
-
1. A node in a cluster of nodes, the node comprising:
-
at least one processor;
a memory coupled to at least one processor;
a cluster node distress signaler residing in said memory, the cluster node distress signaler including a prebuilt distress message indicating failure of the node on a cluster, the cluster node distress signaler sending the prebuilt distress message to the other nodes in the cluster when a failure event is detected. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A cluster node apparatus comprising:
-
at least one processor;
a memory coupled to at least one processor;
a cluster node distress signaler residing in said memory, the cluster node distress signaler including a prebuilt distress message indicating failure of the cluster node apparatus, wherein the cluster node distress signaler includes a method for sending the prebuilt distress message asynchronously that uses a current execution task and a method for sending the prebuilt distress message synchronously method uses a waiting dedicated distress signal execution task, and wherein the cluster node distress signaler sends the prebuilt distress message to the other nodes in the cluster when a failure event is detected. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A method comprising the steps of:
-
providing a prebuilt distress message indicating a node on a cluster is experiencing a failure event; and
sending the prebuilt distress message when an impending failure event is detected within the node. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A method comprising the steps of:
-
providing a prebuilt distress message indicating a node on a cluster is experiencing a failure event;
providing a dedicated distress signal task to process and send the prebuilt distress message;
detecting an impending failure event within the node;
asynchronously sending the prebuilt distress message to other nodes in the cluster when sufficient time before the detected impending failure event exists and otherwise sending the prebuilt distress message to the other nodes in the synchronously; and
assuring the node removes itself from the cluster after the cluster distress signal has been sent to the other nodes in the cluster. - View Dependent Claims (20, 21)
-
-
22. A program product comprising:
-
(A) a cluster node distress signaler including a prebuilt distress message indicating failure of a node on a cluster, the cluster node distress signaler sending the prebuilt distress message from the node to the other nodes in the cluster when a failure event is detected; and
(B) signal bearing media bearing said cluster node distress signaler. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31)
-
-
32. A program product comprising:
-
(A) a cluster node distress signaler including a prebuilt distress message indicating failure of a cluster node apparatus, wherein the cluster node distress signaler includes a method for sending the prebuilt distress message asynchronously that uses a current execution task and a method for sending the prebuilt distress message synchronously method uses a waiting dedicated distress signal execution task, and wherein the cluster node distress signaler sends the prebuilt distress message from the cluster node apparatus to the other nodes in the cluster when a failure event is detected; and
(B) signal bearing media bearing the cluster node distress signaler. - View Dependent Claims (33, 34, 35, 36, 37, 38)
-
Specification