Two node virtual shared disk cluster recovery
First Claim
1. A method for recovery in a two-node data processing system wherein each node is a primary server for a first disk drive and for which there is provided shared access to a second disk for which the other node is a primary server and wherein each node also includes a direct connection to the shared disk for which the other node is the primary server, said method comprising the steps of:
- receiving, at a first node, notification of communication failure with said second node;
determining if said first node has access to the disk for which said first node is the primary server;
shutting down recovery at said first node if said access is not present, but if it is present, accessing the disk for which said second node is the primary server via said hardware connection and waiting for a period of time sufficient to assure that recovery processes at said other node have completed past the same point as said first node;
determining if said first node still has access to the disk for which it is the primary server and if said first node still has access, taking over control of the second node'"'"'s disk; and
if said first node doesn'"'"'t have said access per said immediately preceding determining step, comparing node numbers to decide which node controls the other node'"'"'s disks.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for recovery in a two-node data processing system is provided wherein each node is a primary server for a first nonvolatile storage device and for which there is provided shared access to a second nonvolatile storage device for which the other node is a primary server and wherein each node also includes a direct connection to the shared nonvolatile storage device for which the other node is the primary server. Upon notification of failure, the method operates by first confirming continued access by each node to the nonvolatile storage device for which it is the primary server and then by attempting to access the shared nonvolatile storage device via the direct connection and by waiting for a time sufficient for the same process to be carried out by the other node. If access to the shared nonvolatile storage device is successful, the node takes control of both nonvolatile storage devices. If the access is not successful a comparison of node numbers is carried out to decide the issue of control. Whenever a node determines that it does not have access to the storage device for which it is the primary server, it shuts down recovery at the node.
-
Citations
11 Claims
-
1. A method for recovery in a two-node data processing system wherein each node is a primary server for a first disk drive and for which there is provided shared access to a second disk for which the other node is a primary server and wherein each node also includes a direct connection to the shared disk for which the other node is the primary server, said method comprising the steps of:
-
receiving, at a first node, notification of communication failure with said second node;
determining if said first node has access to the disk for which said first node is the primary server;
shutting down recovery at said first node if said access is not present, but if it is present, accessing the disk for which said second node is the primary server via said hardware connection and waiting for a period of time sufficient to assure that recovery processes at said other node have completed past the same point as said first node;
determining if said first node still has access to the disk for which it is the primary server and if said first node still has access, taking over control of the second node'"'"'s disk; and
if said first node doesn'"'"'t have said access per said immediately preceding determining step, comparing node numbers to decide which node controls the other node'"'"'s disks. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for recovery in a two-node data processing system wherein each node is a primary server for a first disk drive and for which there is provided shared access to a second disk for which the other node is a primary server and wherein each node also includes a direct connection to the shared disk for which the other node is the primary server, said method comprising the steps of:
upon notification of failure, confirming continued access by each node to the disk for which it is the primary server and accessing said shared disk via said direct connection and waiting for a time sufficient for the same process to be carried out by the other node, and if access to said shared disk is successful, taking control of said shared disk via said direct connection. - View Dependent Claims (7, 8)
-
9. A method for recovery in a two-node data processing system wherein each node is a primary server for a first nonvolatile storage device and for which there is provided shared access to a second nonvolatile storage device for which the other node is a primary server and wherein each node also includes a direct connection to the shared nonvolatile storage device for which the other node is the primary server, said method comprising the steps of:
upon notification of failure, confirming continued access by each node to the nonvolatile storage device for which it is the primary server and accessing said shared nonvolatile storage device via said direct connection and waiting for a time sufficient for the same process to be carried out by the other node, and if access to said shared nonvolatile storage device is successful, taking control of said shared nonvolatile storage device via said direct connection.
-
10. A computer apparatus comprising:
-
a two-node data processing system wherein each node is a primary server for a first nonvolatile storage device and for which there is provided shared access to a second nonvolatile storage device for which the other node is a primary server and wherein each node also includes a direct connection to the shared nonvolatile storage device for which the other node is the primary server; and
program means stored within each of said nodes to carry out the steps of, upon notification of failure, confirming continued access by each node to the nonvolatile storage device for which it is the primary server and accessing a shared nonvolatile storage device via a direct connection and waiting for a time sufficient for the same process to be carried out by the other node, and if access to said shared nonvolatile storage device is successful, taking control of said shared nonvolatile storage device via said direct connection.
-
-
11. A computer program product, for recovery operations in the event of node failure in a two node data processing system, stored on a machine readable medium having program means thereon for, upon notification of failure, confirming continued access by each node to a nonvolatile storage device for which it is the primary server and accessing a shared nonvolatile storage device via said direct connection and waiting for a time sufficient for the same process to be carried out by the other node, and if access to said shared nonvolatile storage device is successful, taking control of said shared nonvolatile storage device via said direct connection.
Specification