Method and System for Coordinated Multiple Cluster Failover

US 20100241896A1
Filed: 05/28/2010
Published: 09/23/2010
Est. Priority Date: 04/04/2007
Status: Active Grant

First Claim

Patent Images

1-32. -32. (canceled)

View all claims

15 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Hyperclusters are a cluster of clusters. Each cluster has associated with it one or more resource groups, and independent node failures within the clusters are handled by platform specific clustering software. The management of coordinated failovers across dependent or independent resources running on heterogeneous platforms is contemplated. A hypercluster manager running on all of the nodes in a cluster communicates with platform specific clustering software regarding any failure conditions, and utilizing a rule-based decision making system, determines actions to take on the node. A plug-in extends exit points definable in non-hypercluster clustering technologies. The failure notification is passed to other affected resource groups in the hypercluster.

Citations

40 Claims

1-32. -32. (canceled)

33. A method for coordinating availability of data processing resources between a first cluster of nodes each controlled by a respective first cluster manager and multiple other clusters of nodes each controlled by a respective second cluster manager, the method comprising:
- receiving a disruption signal from an exit program of one of the first cluster managers, the disruption signal being representative of a disruption event associated with a specific one of the nodes of the first cluster, the disruption signal being received by a first hypercluster manager of the specific one of the nodes of the first cluster;
  
  deriving a local action code from a hypercluster rules list, the local action code corresponding to the disruption event and containing a cluster activation sequence for regulating the operation of the multiple other clusters of nodes;
  
  executing the cluster activation sequence on the nodes of the first cluster;
  
  requesting a universal token from the first hypercluster manager and the multiple other clusters of nodes in response to executing the cluster activation sequence on the specific one of the nodes of the first cluster; and
  
  transmitting the local action code to the multiple other clusters of nodes upon receipt of the universal token, each of the nodes of the multiple other clusters of nodes including a second hypercluster manager for execution of the cluster activation sequence thereon;
  
  wherein the first cluster of nodes and the multiple other clusters of nodes each function autonomously and communicate with each other by the local action code.
- View Dependent Claims (34, 35, 36, 37, 38, 39, 40)
- - 34. The method of claim 33, further comprising:
    - synchronizing views of the first and the multiple other clusters of nodes amongst each of the first and the multiple other clusters, view synchrony being determined by the universal token.
  - 35. The method of claim 34, wherein deriving the local action code includes:
    - translating the disruption event to a universal event code with a translation table, the translation table including a first sequence of disruption events and a second sequence of universal event codes correlated thereto.
  - 36. The method of claim 35, wherein the universal event code is referenced to derive the local action code from the hypercluster rules list.
  - 37. The method of claim 34, wherein the cluster activation sequence includes dependencies therebetween, the dependencies establishing the timing and order of the cluster activation sequence.
  - 38. The method of claim 37, wherein transmitting the local action code to the active cluster manager is in response to receiving a confirmation code representative of completion of one step in the cluster activation sequence as defined by the dependencies.
  - 39. The method of claim 34, wherein the active cluster manager is running on one of the nodes of the multiple other clusters, the cluster activation sequence being executed thereon.
  - 40. The method of claim 33, wherein the local action code is included in a hypercluster heartbeat.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Vision Solutions Incorporated (Precisely Software Incorporated)
Original Assignee
Vision Solutions Incorporated (Precisely Software Incorporated)
Inventors
Simpson, Scott, Brown, David E.

Granted Patent

US 8,429,450 B2
Time in Patent Office

Days
Field of Search
US Class Current

714/4
CPC Class Codes

G06F 11/1482   by means of middleware or O...

G06F 11/2033   switching over of hardware ...

G06F 11/2035   without idle spare hardware

G06F 11/2038   with a single idle spare pr...

G06F 11/2041   with more than one idle spa...

G06F 11/2046   where the redundant compone...

Method and System for Coordinated Multiple Cluster Failover

First Claim

15 Assignments

0 Petitions

Accused Products

Abstract

Citations

40 Claims

Specification

Solutions

Use Cases

Quick Links

Method and System for Coordinated Multiple Cluster Failover

First Claim

15 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

40 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links