Failover system and method for cluster environment
First Claim
1. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
- following the failure of one of said nodes, determining the weight of at least two surviving nodes;
determining which of said at least two surviving nodes has the lowest weight;
assigning applications running on said failed node to said surviving node having the lowest determined weight.
15 Assignments
0 Petitions
Accused Products
Abstract
A failover method and system is provided for a computer system having at least three nodes operating as a cluster. One method includes the steps of detecting failure of one node, determining the weight of at least two surviving nodes, and assigning a failover node based on the determined weights of the surviving nodes. Another method includes the steps detecting failure of one node and determining the time of failure, and assigning a failover node based in part on the determined time of failure. This method may also include the steps of determining a time period during which nodes in the cluster are heavily utilized, and assigning a failover node that is not heavily utilized during that time period.
156 Citations
11 Claims
-
1. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
-
following the failure of one of said nodes, determining the weight of at least two surviving nodes;
determining which of said at least two surviving nodes has the lowest weight;
assigning applications running on said failed node to said surviving node having the lowest determined weight. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
-
determining the amount of resources needed by applications running on one of said nodes;
following failure of said one node, for each of surviving nodes n=1 to N until a failover node is assigned;
determining a weight of surviving node n;
determining from said weight of said surviving node n whether said surviving node n has available resources greater than that determined to be needed by said failed node; and
if said surviving node n is determined to have sufficient available resources, then assigning node n as said failover node, and failing over applications running on said failed node to said failover node, or if said surviving node n is determined not to have sufficient available resources, then n=n+1.
-
-
9. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
-
determining a weight of each of said at least three nodes;
ordering said at least three nodes according to their respective increasing weights from lowest to highest;
creating a queue containing said ordered nodes, wherein said first node in said queue has the lowest weight; and
following failure of one of said at least three nodes, assigning said first surviving node in said queue as a failover node, and failing over applications running on said failed node to said failover node.
-
-
10. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
-
detecting the failure of one of said nodes;
determining the weight of at least two surviving nodes, wherein for each of said at least two surviving nodes, said weight is determined by evaluating at least a performance indicator indicating current CPU utilization of the surviving node;
determining which of said at least two surviving nodes has the lowest weight; and
assigning applications running on said failed node to said node determined to have said lowest weight.
-
-
11. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
-
for each node, determining the amount of resources needed by applications running on the node;
following failure of a node, selecting a node from among the group of surviving nodes;
determining a weight of the selected surviving node;
evaluating on the basis of the determined weight of the selected surviving node whether the selected surviving node has more available resources than the resources needed by said failed node;
if the selected surviving node n has more available resources than the resources needed by said failed node, assigning the selected surviving node as the designated failover node and migrating any applications of the failed node to the designated failover node; and
if the selected surviving node does not have more available resources than the resources needed by said failed node, selecting another surviving node from the group of surviving nodes.
-
Specification