×

Dynamic rate heartbeating for inter-node status updating

  • US 9,219,621 B2
  • Filed: 03/21/2012
  • Issued: 12/22/2015
  • Est. Priority Date: 12/03/2010
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for determining node operating status among a cluster of nodes of a computer system, the method comprising:

  • transmitting gossip messages directly between pairs of nodes of the cluster, the gossip messages containing an indication of operational status of nodes of the cluster other than the nodes corresponding to the pair of nodes between which the gossip message is communicated, wherein the transmitting is performed periodically according to a heartbeat rate;

    receiving the gossip messages at a receiving one of the corresponding pair of nodes and storing indications of communications delays for the gossip messages;

    computing statistics of the communications delay for the gossip messages, wherein the computing computes a mean and mean deviation of the communications delays, and wherein the mean of the communications delay is a mean round-trip communications time (MRT) computed according to the formula MRT(t)=0.875*MRT(t−

    Δ

    t)+0.125*TMEAS, where Δ

    t is a period corresponding to the heartbeat rate and TMEAS is the most-recently measured round-trip communications time, and wherein the mean deviation of the communications delay D is computed according to D(t)=0.125*D(t−

    Δ

    t)+0.875*ERR, where ERR=|MRT|−

    TMEAS;

    adjusting parameters for node status monitoring according to the computed statistics, wherein the adjusting parameters of the node status monitoring comprises adjusting a threshold maximum number of missed receptions of the receiving used to determine whether a node is operational according to the mean round-trip communications time and the mean deviation of the communications delay; and

    monitoring the operational status of the nodes according to the indications of communications delay, the parameters, and the operational status of the other nodes in the cluster as communicated by the gossip messages.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×