System and method for handling multi-node failures in a disaster recovery cluster
First Claim
Patent Images
1. A method comprising:
- determining that a candidate node is not available for a switchover operation to takeover for a failed node of a plurality of nodes;
identifying an alternate node for the switchover operation based upon a determination that the alternate node has resources capable of handling a load of the failed node and that an object limit option for the alternate node is disabled;
performing the switchover operation to directly transfer ownership of a storage object of the failed node to the alternate node without transferring ownership of the storage object to the candidate node;
recovering data from a non-volatile memory to the storage object; and
bringing the storage object online.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for handling multi-node failures in a disaster recovery cluster is provided. In the event of an error condition, a switchover operation occurs from the failed nodes to one or more surviving nodes. Data stored in non-volatile random access memory is recovered by the surviving nodes to bring storage objects, e.g., disks, aggregates and/or volumes into a consistent state.
-
Citations
20 Claims
-
1. A method comprising:
-
determining that a candidate node is not available for a switchover operation to takeover for a failed node of a plurality of nodes; identifying an alternate node for the switchover operation based upon a determination that the alternate node has resources capable of handling a load of the failed node and that an object limit option for the alternate node is disabled; performing the switchover operation to directly transfer ownership of a storage object of the failed node to the alternate node without transferring ownership of the storage object to the candidate node; recovering data from a non-volatile memory to the storage object; and bringing the storage object online. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computing device comprising:
-
a memory containing computer readable medium having stored thereon instructions for performing a method; and a processor coupled with the memory, the processor configured to execute the instructions to cause the processor to; determine that a candidate node is not available for a switchover operation to takeover for a failed node of a plurality of nodes; identify an alternate node for the switchover operation based upon a determination that the alternate node has resources capable of handling a load of the failed node and that an object limit option for the alternate node is disabled; perform the switchover operation to directly transfer ownership of a storage object of the failed node to the alternate node without transferring ownership of the storage object to the candidate node; recover data from a non-volatile memory to the storage object; and bring the storage object online. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer readable medium comprising program instructions, that when executed by a processor, cause the processor to:
-
determine that a candidate node is not available for a switchover operation to takeover for a failed node of a plurality of nodes; identify an alternate node for the switchover operation based upon a determination that the alternate node has resources capable of handling a load of the failed node and that an object limit option for the alternate node is disabled; perform the switchover operation to directly transfer ownership of a storage object of the failed node to the alternate node without transferring ownership of the storage object to the candidate node; recover data from a non-volatile memory to the storage object; and bring the storage object online. - View Dependent Claims (20)
-
Specification