Failover management system
First Claim
1. A system comprising:
- a plurality of nodes, each node having a processor executable thereon;
a first server group, the first server group including a first server capable of performing a first service and a second server capable of performing the first service, the first server being in one of a plurality of states including an active state and a standby state, the second server being in one of the active state and an inactive state, the first server being executable on a first node of the plurality of nodes and the second server being executable on a second node of the plurality of nodes;
a second server group, the second server group including a third server capable of performing a second service and a fourth server capable of performing the second service, the third server being in one of the plurality of states including the active state and the standby state, the fourth server being in one of the active state and an inactive state, the third server being executable on the first node, the fourth server being executable on one of the plurality of nodes other than the first node;
a failover management system, the failover management system, upon determining that a failure has occurred on the first node, instructing the second server to change its state to the active state if the first server was in the active state when the failure determination occurred, and instructing the fourth server to change its state to the active state if the third server was in the active state when the failure determination occurred.
1 Assignment
0 Petitions
Accused Products
Abstract
A system is provided which includes a plurality of nodes, wherein each node has a processor executable thereon. The system also includes a first failover server group that includes a first server that is capable of performing a first service and a second server capable of performing the first service. The first server is executable on a first node of the plurality of nodes and the second server is executable on a second node of the plurality of nodes. The third server is executable on the first node and the fourth server being executable on one of the plurality of nodes other than the first node. The system also includes a second failover server group that includes a third server capable of performing a second service and a fourth server capable of performing the second service. The first, second, third and fourth servers can each be in one of a plurality of states including an active state and a standby state. The system also includes a failover management system that, upon determining that a failure has occurred on the first node, instructs the second server to change its state to the active state if the first server was in the active state when the failure determination occurred, and instructs the fourth server to change its state to the active state if the third server was in the active state when the failure determination occurred.
235 Citations
40 Claims
-
1. A system comprising:
-
a plurality of nodes, each node having a processor executable thereon;
a first server group, the first server group including a first server capable of performing a first service and a second server capable of performing the first service, the first server being in one of a plurality of states including an active state and a standby state, the second server being in one of the active state and an inactive state, the first server being executable on a first node of the plurality of nodes and the second server being executable on a second node of the plurality of nodes;
a second server group, the second server group including a third server capable of performing a second service and a fourth server capable of performing the second service, the third server being in one of the plurality of states including the active state and the standby state, the fourth server being in one of the active state and an inactive state, the third server being executable on the first node, the fourth server being executable on one of the plurality of nodes other than the first node;
a failover management system, the failover management system, upon determining that a failure has occurred on the first node, instructing the second server to change its state to the active state if the first server was in the active state when the failure determination occurred, and instructing the fourth server to change its state to the active state if the third server was in the active state when the failure determination occurred. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 18, 19, 20, 21, 26, 29, 30, 31, 32, 33, 34, 36, 37, 38, 39)
-
-
17. A failover management process executable on a node that includes a first server in a first server group and a second server in a second server group, the failover management process comprising the steps of:
-
determining a current state of the first server and a current state of the second server, the current state of each server being one of a plurality of states including an active state, a standby state, and a failed state;
monitoring a current state of a third server on a remote node, the third server being one of the servers in the first server group, the current state of the third server being one of the plurality of states including the active state, the standby state, and the failed state;
monitoring a current state of a fourth server on a remote node, the fourth server being one of the servers in the second server group, the current state of the fourth server being one of the plurality of states including the active state, the standby state, and the failed state;
notifying a process on the remote node executing the third server and a process on the remote node executing the fourth server of changes in the current state of the first server and the second server;
if the current status of the first server is the standby state, and the current state of the third server is the failed state, changing the current state of the first server to the active state;
if the current state of the second server is the standby state, and the current state of the fourth server is the failed state, changing the current state of the second server to the active state.
-
-
22. A system comprising:
-
a plurality of nodes, each node having a processor executable thereon;
a first server group, the first server group including a first server capable of performing a first service and a second server capable of performing the first service, the first server being in one of a plurality of states including an active state and a standby state, the second server being in one of the active state and an inactive state, the first server being executable on a first node of the plurality of nodes and the second server being executable on a second node of the plurality of nodes;
a second server group, the second server group including a third server capable of performing a second service and a fourth server capable of performing the second service, the third server being in one of the plurality of states including the active state and the standby state, the fourth server being in one of the active state and an inactive state, the third server being executable on the first node, the fourth server being executable on a third node of the plurality of nodes;
a failover management system, the failover management system, upon determining that a failure has occurred on the first server but not on the third server, instructing the second server to change its state to the active state if the first server was in the active state when the failure determination occurred, the fourth server remaining in a standby state if the third server was in the active state when the failure determination occurred. - View Dependent Claims (24, 25, 27, 28)
-
-
23. A failover management system, comprising:
-
a global failover controller executable on a first node of a plurality of nodes;
a first server group, the first server group including a first server capable of performing a first service and a second server capable of performing the first service, the first server being in one of a plurality of states including an active state and a standby state, the second server being in one of the active state and an inactive state, the first server being executable on a second node of the plurality of nodes and the second server being executable on a third node of the plurality of nodes;
a second server group, the second server group including a third server capable of performing a second service and a fourth server capable of performing the second service, the third server being in one of the plurality of states including the active state and the standby state, the fourth server being in one of the active state and an inactive state, the third server being executable on the first node, the fourth server being executable on a node other than the second node and the third node of the plurality of nodes;
a first local failover controller executable on the second node, and a second local failover controller executable on the third node, the first local failover controller notifying the global failover controller of a current state of the first server and the third server, the second local failover controller notifying the global failover controller of a current state of the second server;
the global failover controller notifying the first local failover controller of the current state of the second server and the fourth server and notifying the second failover controller of a current state of the first server;
the first local failover controller, upon receiving notification that the second server is in an inactive state, instructing the first server to change its state to the active state if the first server was in an inactive state when the notification was received, the second local failover controller, upon receiving notification that the first server is in an inactive state, instructing the second server to change its state to the active state if the second server was in an inactive state when the notification was received, the first local failover controller, upon receiving notification that the fourth server is in an inactive state, instructing the third server to change its state to the active state if the fourth server was in an inactive state when the notification was received.
-
-
35. A computer readable medium, having stored thereon, computer executable process steps that are executable on a node that includes a first server in a first server group and a second server in a second server group, the computer executable process steps comprising:
-
determining a current state of the first server and a current state of the second server, the current state of each server being one of a plurality of states including an active state, a standby state, and a failed state;
monitoring a current state of a third server on a remote node, the third server being one of the servers in the first server group, the current state of the third server being one of the plurality of states including the active state, the standby state, and the failed state;
monitoring a current state of a fourth server on a remote node, the fourth server being one of the servers in the second server group, the current state of the fourth server being one of the plurality of states including the active state, the standby state, and the failed state;
notifying a process on the remote node executing the third server and a process on the remote node executing the fourth server of changes in the current state of the first server and the second server;
if the current status of the first server is the standby state, and the current state of the third server is the failed state, changing the current state of the first server to the active state;
if the current state of the second server is the standby state, and the current state of the fourth server is the failed state, changing the current state of the second server to the active state.
-
-
40. A system comprising:
-
a plurality of nodes, each node having a processor executable thereon;
a first server group, the first server group including a first server capable of performing a first service and a second server capable of performing the first service, the first server being in one of a plurality of states including an active state, a standby state, an offline state, an initialized state and a failed state, the second server being in one of the active state, the standby state, the offline state, the initialized state, and the failed state, the first server being executable on a first node of the plurality of nodes and the second server being executable on a second node of the plurality of nodes;
a second server group, the second server group including a third server capable of performing a second service and a fourth server capable of performing the second service, the third server being in one of the plurality of states including the active state, the standby state, the offline state, the initialized state and the failed state, the fourth server being in one of the active state and the standby state, the failed state, the initialized state and the offline state, the third server being executable on the first node, the fourth server being executable on one of the plurality of nodes other than the first node;
a failover management system, the failover management system, upon determining that a failure has occurred on the first node, instructing the second server to change its state to the active state if the first server was in the active state when the failure determination occurred and if the second server was not in one of the failed state, the initialized state, and the offline state, and instructing the fourth server to change its state to the active state if the third server was in the active state when the failure determination occurred, and if the fourth server was not in one of the failed state, the initialized state, and the offline state
-
Specification