Failover system and method for cluster environment
First Claim
1. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
- following failure of one of said nodes, determining the weight of at least two surviving nodes;
determining which of said at least two surviving nodes has the lowest weight;
assigning applications running on said failed node to said surviving node having the lowest determined weight.
15 Assignments
0 Petitions
Accused Products
Abstract
A failover method and system is provided for a computer system having at least three nodes operating as a cluster. One method includes the steps of detecting failure of one node, determining the weight of at least two surviving nodes, and assigning a failover node based on the determined weights of the surviving nodes. Another method includes the steps detecting failure of one node and determining the time of failure, and assigning a failover node based in part on the determined time of failure. This method may also include the steps of determining a time period during which nodes in the cluster are heavily utilized, and assigning a failover node that is not heavily utilized during that time period.
111 Citations
24 Claims
-
1. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
-
following failure of one of said nodes, determining the weight of at least two surviving nodes;
determining which of said at least two surviving nodes has the lowest weight;
assigning applications running on said failed node to said surviving node having the lowest determined weight. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
-
determining the amount of resources needed by applications running on one of said nodes;
following failure of said one node, for each of surviving nodes n=1 to N until a failover node is assigned;
determining a weight of surviving node n;
determining from said weight of said surviving node n whether said surviving node n has available resources greater than that determined to be needed by said failed node; and
if said surviving node n is determined to have sufficient available resources, then assigning node n as said failover node, and failing over applications running on said failed node to said failover node, or if said surviving node n is determined not to have sufficient available resources, then n=n+1. - View Dependent Claims (9)
-
-
10. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
-
determining a weight of each of said at least three nodes;
ordering said at least three nodes according to their respective increasing weights from lowest to highest;
creating a queue containing said ordered nodes, wherein said first node in said queue has the lowest weight; and
following failure of one of said at least three nodes, assigning said first surviving node in said queue as a failover node, and failing over applications running on said failed node to said failover node.
-
-
11. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
-
following failure of one of said at least three nodes, determining the order in which surviving nodes joined said cluster;
assigning a failover node according to the order in which said surviving nodes joined said cluster; and
failing over all applications running on said failed node to said failover node. - View Dependent Claims (12, 13)
-
-
14. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
-
detecting failure of one of said at least three nodes;
determining a time of failure of said one node;
assigning a failover node taking into consideration said determined time; and
assigning applications running on said failed node to said failover node. - View Dependent Claims (15, 16)
-
-
17. A cluster computer system comprising:
at least three nodes, wherein said at least three nodes are computer systems operating as a cluster, wherein said cluster computer system is capable of implementing a failover policy in which, following failure of one of said at least three nodes, the weight of surviving nodes is determined, and a failover node is selected based on said determined weights. - View Dependent Claims (18, 19)
-
20. A cluster computer system comprising:
at least three nodes, wherein said at least three nodes are computer systems operating as a cluster, wherein said cluster computer system is capable of determining a time of failure of one of said at least three nodes, and implementing a failover policy in which, following failure of said one node, a failover node is selected based in part on said determined time. - View Dependent Claims (21, 22)
-
23. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
-
detecting failure of one of said nodes;
determining the weight of at least two surviving nodes, wherein for each of said at least two surviving nodes, said weight is determined by evaluating at least a performance indicator indicating current CPU utilization of the surviving node;
determining which of said at least two surviving nodes has the lowest weight; and
assigning applications running on said failed node to said node determined to have said lowest weight.
-
-
24. A failover method for a computer system having at least three nodes operating as a cluster, said method comprising the steps of:
-
determining a time period during which each of said at least three nodes are heavily utilized;
detecting failure of one of said at least three nodes;
determining a time of failure of said one node;
preventing failover of applications running on said one node to other of said at least three nodes if said determined time period for said other node includes said determined time of failure.
-
Specification