System and method for maintaining data integrity in a cluster network
First Claim
Patent Images
1. A method for failure recovery in a network, comprising the steps of:
- identifying a failed storage enclosure of the network;
identifying a logical storage unit owned by a first server node of the network;
identifying the storage drives of the logical unit that are accessible by the first server node;
determining whether the storage drives of the logical unit that are accessible by the first server node comprise an operational set of data;
if the storage drives accessible by the first storage node do not comprise an operational set of data, identifying the logical unit to an alternate server node;
determining whether the alternate server node can access a set of storage drives of the logical unit that include an operational set of data; and
transferring ownership of the logical unit to the alternate server node if the alternate server node can access a set of storage drives that include an operational set of data of the logical unit.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for failure recovery and failure management in a cluster network is disclosed. Following a failure of a storage enclosure or a communication link failure between storage enclosures, each server node of the network determines whether the server node can access the drives of each logical unit owned by the server node. If the server node cannot access a set of drives of the logical that include an operational set of data, an alternate server node is queried to determine if the alternate server node can access the a set of drives of the logical unit that include an operational set of data.
8 Citations
20 Claims
-
1. A method for failure recovery in a network, comprising the steps of:
-
identifying a failed storage enclosure of the network;
identifying a logical storage unit owned by a first server node of the network;
identifying the storage drives of the logical unit that are accessible by the first server node;
determining whether the storage drives of the logical unit that are accessible by the first server node comprise an operational set of data;
if the storage drives accessible by the first storage node do not comprise an operational set of data, identifying the logical unit to an alternate server node;
determining whether the alternate server node can access a set of storage drives of the logical unit that include an operational set of data; and
transferring ownership of the logical unit to the alternate server node if the alternate server node can access a set of storage drives that include an operational set of data of the logical unit. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A network, comprising:
-
a first server node;
a second server node;
a first storage enclosure coupled to the first server node, wherein the first storage enclosure includes a plurality of storage drives;
a second storage enclosure coupled to the second server node, wherein the second storage enclosure includes a plurality of storage drives;
an intermediate storage enclosure positioned communicatively between the first storage enclosure and the second storage enclosure such that the intermediate storage enclosure is communicatively coupled to the first storage enclosure and the second storage enclosure, wherein the intermediate storage enclosure includes a plurality of storage drives and wherein each storage drive is accessible to the first server node and the second server node;
wherein the each of the server nodes have logical ownership over one or more logical units comprised of storage drives of the storage enclosures;
wherein, in the event of a failure of a storage enclosure, each server node is operable to, evaluate, for each logical unit owned by each server node, whether the server node has access to storage drives of the logical unit that comprise an operational set of data; and
for each logical unit owned by the server node, transferring ownership of the logical unit to the other server node if the server node does not have access to storage drives of the logical unit that comprise an operational set of data and if the other server node does have access to storage drives of the logical unit that comprise an operational set of data. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A method for failure recovery in a network, wherein the network comprises first and second server nodes communicatively coupled to a set of multiple storage enclosures, wherein each of the storage enclosures includes multiple storage drives logically organized into logical storage units, comprising the steps of:
-
identifying a failed storage enclosure of the network;
identifying a logical storage unit owned by the first server node of the network;
identifying the storage drives of the logical unit that are accessible by the first server node;
determining whether the storage drives of the logical unit that are accessible by the first server node comprise an operational set of data;
if the storage drives accessible by the first storage node do not comprise an operational set of data, identifying the logical unit to the second server node;
determining whether the second server node can access a set of storage drives of the logical unit that include an operational set of data; and
transferring ownership of the logical unit to the second server node if the second server node can access a set of storage drives of the logical unit that include an operational set of data. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification