Negotiated graceful takeover in a node cluster
First Claim
1. A method for use in a negotiated graceful takeover in a computer cluster having a first and second computer, the method comprising the steps of:
- detecting an operational fault at the first computer;
requesting, from the first computer, in response to the operational fault, that the second computer take over for the first computer;
requesting, from the second computer, that the first computer shut down;
completing service requests at the first computer pending at the time the first computer was requested to shut down;
transferring responsibilities of the first computer to the second computer; and
shutting down the first computer.
2 Assignments
0 Petitions
Accused Products
Abstract
The invention is a method for operating a file server system in a cluster mode that provides for relatively rapid and reliable takeover of a failed file server in the cluster by a partner file server when the failed file server has detected a fault that will cause it to shut down. This is done by the failed file server requesting the one file server to take over its file services after detecting the fault in its operations, the one file server letting the failed file server complete existing file service requests from clients, refusing further file service requests addressed to the failed file server, and the one file server taking over by transferring file service requests to the one file server. As part of this takeover the one file server takes on the identity of the failed filer and activates network interfaces and network addresses that replicate the failed server'"'"'s network addresses.
161 Citations
44 Claims
-
1. A method for use in a negotiated graceful takeover in a computer cluster having a first and second computer, the method comprising the steps of:
-
detecting an operational fault at the first computer;
requesting, from the first computer, in response to the operational fault, that the second computer take over for the first computer;
requesting, from the second computer, that the first computer shut down;
completing service requests at the first computer pending at the time the first computer was requested to shut down;
transferring responsibilities of the first computer to the second computer; and
shutting down the first computer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A storage system capable of performing a negotiated graceful takeover, the storage system comprising:
-
a first computer;
a second computer;
a first processor for the first computer to i) detect an operational fault at the first computer, and ii) request, in response to the operational fault, that the second computer take over for the first computer, and a second processor for the second computer to i) request that the first computer shut down, ii) allow the first computer to complete service requests pending at the time the first computer was requested to shut down, iii) take over any responsibilities of the first computer, and iv) allow the first computer to shut down. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42)
-
-
43. A storage system capable of performing a negotiated graceful takeover, the storage system comprising:
-
a first computer;
a second computer;
means for detecting an operational fault at the first computer;
means for requesting, from the first computer, in response to the operational fault, that the second computer take over for the first computer;
means for requesting, from the second computer, that the first computer shut down;
means for completing service requests at the first computer pending at the time the first computer was requested to shut down;
means for transferring responsibilities of the first computer to the second computer; and
means for shutting down the first computer.
-
-
44. A computer readable media, comprising:
- the computer readable media containing instructions for execution in a processor for the method of,
detecting an operational fault at a first computer;
requesting, from the first computer, in response to the operational fault, that a second computer take over for the first computer;
requesting, from the second computer, that the first computer shut down;
completing service requests at the first computer pending at the time the first computer was requested to shut down;
transferring responsibilities of the first computer to the second computer; and
shutting down the first computer.
- the computer readable media containing instructions for execution in a processor for the method of,
Specification