×

FAST CLUSTER FAILURE DETECTION

  • US 20110219263A1
  • Filed: 03/04/2010
  • Published: 09/08/2011
  • Est. Priority Date: 12/30/2009
  • Status: Active Grant
First Claim
Patent Images

1. A method for fast failure detection in a distributed computer system, comprising:

  • executing a distributed computer system having a plurality of clusters comprising at least a first cluster, a second cluster and the third cluster;

    initializing failure detection by creating a connected cluster list in each of the plurality of clusters, wherein for each one of the plurality of clusters, a respective connected cluster list describes others of the plurality of clusters said each one is communicatively connected with;

    sending a status update message upon a change in connectivity between the plurality of clusters;

    generating an updated connected cluster list in each of the plurality of clusters in accordance with the status update message; and

    determining whether the change in connectivity is a result of a cluster failure by examining the updated connected cluster list in each of the plurality of clusters.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×