Method for detecting the quick restart of liveness daemons in a distributed multinode data processing system
First Claim
1. A method for detecting the quick restart of liveness daemons in a distributed, multinode data processing system in which nodes communicate liveness indicia in the form of heartbeat signals via adapters coupled to each node, said method comprising the steps of:
- sending, from a first node to other nodes that are not in the sender'"'"'s membership group, a first message which includes at least indicia of occurrence of a quick restart; and
determining, from said indicia of occurence of said quick restart and from locally stored group membership information, the existence of a quick restart at said first node, and responding thereto by sending a second message which indicates that said first node is to be expelled from the group.
1 Assignment
0 Petitions
Accused Products
Abstract
In distributed multinode data processing systems, mechanisms are employed to insure that the nodes are properly informed about the liveness of the other nodes in node groups in the network. In particular, the present invention employs group membership indicia as part of a mechanism for detecting that a node and/or its adapter have failed and have been recently restarted. Having detected this situation, group membership inconsistencies which it can engender are avoided.
42 Citations
5 Claims
-
1. A method for detecting the quick restart of liveness daemons in a distributed, multinode data processing system in which nodes communicate liveness indicia in the form of heartbeat signals via adapters coupled to each node, said method comprising the steps of:
-
sending, from a first node to other nodes that are not in the sender'"'"'s membership group, a first message which includes at least indicia of occurrence of a quick restart; and
determining, from said indicia of occurence of said quick restart and from locally stored group membership information, the existence of a quick restart at said first node, and responding thereto by sending a second message which indicates that said first node is to be expelled from the group. - View Dependent Claims (2, 3)
-
-
4. A multinode data processing system comprising:
-
a plurality of data processing nodes connected in a network capable of transmitting messages between nodes;
storage means within said nodes containing program code for sending, from a first node to other nodes that are not in the sender'"'"'s membership group a first message which includes at least indicia of occurrence of a quick restart and for determining, from said indicia of occurrence of said quick restart and from group membership information in storage at at least one recipient node, the existence of said quick restart at said first node, and responding thereto by sending a second message which indicates that said first node is to be expelled from the group.
-
-
5. A machine readable medium containing program code for use in a multinode data processing system for sending, from a first node to other nodes that are not in the sender'"'"'s membership group a first message which includes at least indicia of occurrence of a quick restart and for determining, from said indicia of occurrence of said quick restart and from group membership information in storage at at least one recipient node, the existence of a quick restart at said first node, and responding thereto by sending a second message which indicates that said first node is to be expelled from the group
Specification