Failure hierarchy in a cluster filesystem
First Claim
1. A method of responding to node failure in a cluster of computer system nodes sharing direct read/write access to storage devices via a storage area network, comprising:
- defining an order of procedures to be attempted upon detection of a node failure; and
executing one procedure at a time in the order defined, until successful completion of the procedure.
4 Assignments
0 Petitions
Accused Products
Abstract
A cluster of computer system nodes share direct read/write access to storage devices via a storage area network using a cluster filesystem. In response to the failure of a node, a pre-defined order of procedures is attempted, executing one procedure at a time in the order defined, until successful completion of one of the procedures. Preferably the order is based on input from a system administrator, or a default order when no input has been provided by the system administrator. The procedures may include hardware reset of a failed node, disabling access by the failed node to the storage devices shared by the nodes in the cluster, terminating shared filesystem services on the failed node and terminating shared filesystem services on all of the nodes in the cluster.
96 Citations
4 Claims
-
1. A method of responding to node failure in a cluster of computer system nodes sharing direct read/write access to storage devices via a storage area network, comprising:
-
defining an order of procedures to be attempted upon detection of a node failure; and
executing one procedure at a time in the order defined, until successful completion of the procedure. - View Dependent Claims (2, 3, 4)
-
Specification