Method and System for Coordinated Multiple Cluster Failover
15 Assignments
0 Petitions
Accused Products
Abstract
Hyperclusters are a cluster of clusters. Each cluster has associated with it one or more resource groups, and independent node failures within the clusters are handled by platform specific clustering software. The management of coordinated failovers across dependent or independent resources running on heterogeneous platforms is contemplated. A hypercluster manager running on all of the nodes in a cluster communicates with platform specific clustering software regarding any failure conditions, and utilizing a rule-based decision making system, determines actions to take on the node. A plug-in extends exit points definable in non-hypercluster clustering technologies. The failure notification is passed to other affected resource groups in the hypercluster.
-
Citations
40 Claims
-
1-32. -32. (canceled)
-
33. A method for coordinating availability of data processing resources between a first cluster of nodes each controlled by a respective first cluster manager and multiple other clusters of nodes each controlled by a respective second cluster manager, the method comprising:
-
receiving a disruption signal from an exit program of one of the first cluster managers, the disruption signal being representative of a disruption event associated with a specific one of the nodes of the first cluster, the disruption signal being received by a first hypercluster manager of the specific one of the nodes of the first cluster; deriving a local action code from a hypercluster rules list, the local action code corresponding to the disruption event and containing a cluster activation sequence for regulating the operation of the multiple other clusters of nodes; executing the cluster activation sequence on the nodes of the first cluster; requesting a universal token from the first hypercluster manager and the multiple other clusters of nodes in response to executing the cluster activation sequence on the specific one of the nodes of the first cluster; and transmitting the local action code to the multiple other clusters of nodes upon receipt of the universal token, each of the nodes of the multiple other clusters of nodes including a second hypercluster manager for execution of the cluster activation sequence thereon; wherein the first cluster of nodes and the multiple other clusters of nodes each function autonomously and communicate with each other by the local action code. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40)
-
Specification