Virtual shared disks with application transparent recovery

US 5,668,943 A
Filed: 05/24/1996
Issued: 09/16/1997
Est. Priority Date: 10/31/1994
Status: Expired due to Term

First Claim

Patent Images

1. A clustered multi-processing system comprising:

at least three interconnected nodes wherein less than all nodes are server nodes, each node including a memory;

a multi-ported disk having at least a primary tail physically attached to a primary server node and a secondary tail physically attached to a secondary server node;

a disk access request mechanism, coupled to the nodes, for communicating a disk access request from an originating node to a server node physically attached to the disk along one of at least a primary disk access path and a secondary disk access path defined between the originating node, the server nodes and the disk;

a failure detection mechanism, coupled to the nodes, for detecting failures along one of the primary disk access path and the secondary disk access path; and

,proxy logic stored in the memory on each of the nodes and coupled to the failure detection mechanism, for redirecting subsequent disk access requests along a non-failing disk access path to the disk, when a failure is detected;

said proxy logic comprising a two-phase commit protocol including;

a coordinator node being adapted for broadcasting a suspend message to participant nodes to suspend access to a failed disk access path and waiting for an acknowledge message from all participant nodes;

each participant node receiving the suspend message being adapted for suspending said access to the failed disk access path, sending the acknowledge message to the coordinator node confirming suspension of said access to the failed disk access path, and waiting for a resume message from the coordinator node;

the coordinator node being further adapted for sending the resume message upon receipt of the acknowledge message from said all participant nodes; and

said each participant node being further adapted for redirecting said subsequent disk access requests along the non-failing disk access path to the disk, upon receipt of the resume message.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system and method for recovering from failures in the disk access path of a clustered computing system. Each node of the clustered computing system is provided with proxy software for handling physical disk access requests from applications executing on the node and for directing the disk access requests to an appropriate server to which the disk is physically attached. The proxy software on each node maintains state information for all pending requests originating from that node. In response to detection of a failure along the disk access path, the proxy software on all of the nodes directs all further requests for disk access to a secondary node physically attached to the same disk.

285 Citations

9 Claims

1. A clustered multi-processing system comprising:
- at least three interconnected nodes wherein less than all nodes are server nodes, each node including a memory;
  
  a multi-ported disk having at least a primary tail physically attached to a primary server node and a secondary tail physically attached to a secondary server node;
  
  a disk access request mechanism, coupled to the nodes, for communicating a disk access request from an originating node to a server node physically attached to the disk along one of at least a primary disk access path and a secondary disk access path defined between the originating node, the server nodes and the disk;
  
  a failure detection mechanism, coupled to the nodes, for detecting failures along one of the primary disk access path and the secondary disk access path; and
  
  ,proxy logic stored in the memory on each of the nodes and coupled to the failure detection mechanism, for redirecting subsequent disk access requests along a non-failing disk access path to the disk, when a failure is detected;
  
  said proxy logic comprising a two-phase commit protocol including;
  
  a coordinator node being adapted for broadcasting a suspend message to participant nodes to suspend access to a failed disk access path and waiting for an acknowledge message from all participant nodes;
  
  each participant node receiving the suspend message being adapted for suspending said access to the failed disk access path, sending the acknowledge message to the coordinator node confirming suspension of said access to the failed disk access path, and waiting for a resume message from the coordinator node;
  
  the coordinator node being further adapted for sending the resume message upon receipt of the acknowledge message from said all participant nodes; and
  
  said each participant node being further adapted for redirecting said subsequent disk access requests along the non-failing disk access path to the disk, upon receipt of the resume message.
- View Dependent Claims (2, 3)
- - 2. The system of claim 1 comprising a queue, in each of the nodes, for storing incoming access requests to the disk;
    - and, means for rerouting the requests in the queue to the disk by way of the non-failing disk access path.
  - 3. The system of claim 1 wherein all server nodes include a disk adapter coupled to the disk and wherein the failure detection mechanism includes means for detecting failures in any of the server nodes and in the disk adapter.

4. A method for recovering from failures along a disk access path in a clustered computing system which includes at least three interconnected nodes wherein less than all nodes are server nodes and wherein each node includes a memory, and a multi-ported disk having at least a primary tail physically attached to a primary server node and a secondary tail physically attached to a secondary server node, comprising the steps of:
- detecting a failure in a disk access path in the clustered computing system, wherein a failed access path is associated with the primary tail;
  
  upon detection of the failure, a coordinator node broadcasting a message to all nodes of the system having access to the disk;
  
  each node receiving the message suspending access to the disk and acknowledging suspension of the access to the disk to the coordinator node;
  
  the coordinator node broadcasting a second message to the nodes to resume the access to the disk, responsive to said step of acknowledging suspension of the access to the disk; and
  
  each node receiving the second message, resuming the access to the disk by the secondary tail.
- View Dependent Claims (5, 6, 7, 8)
- - 5. A method for recovering from failures along a disk access path as claimed in claim 4, comprising the steps of:
    - in response to the message, saving pending requests that had been sent to the disk along the failed access path and saving requests that arrive while the access to the disk is suspended; and
      
      re-issuing all of the requests to the secondary server, responsive to said step of resuming the access to the disk at each node.
  - 6. A method as claimed in claim 5 wherein said all requests are saved in a queue, wherein said step of re-issuing further comprises the step of rerouting the requests in the queue to the disk by way of the secondary server node.
  - 7. A method for recovering from failures along a disk access path as claimed in claim 4 wherein all server nodes include a disk adapter and wherein the disk access path includes the disk adapter and the server nodes and wherein said step of detecting a failure comprises the step of detecting the failure in at least one of the nodes or the disk adapters.
  - 8. A method for recovering from failures along a disk access path as claimed in claim 4, comprising the step of designating a backup coordinator node to perform the coordination of recovery if the coordinator node fails.

9. A clustered multi-processing system comprising:
- at least three interconnected nodes wherein less than all nodes are server nodes, each node including a memory;
  
  a multi-ported disk having at least a primary tail physically attached to a primary server node and a secondary tail physically attached to a secondary server node;
  
  a disk access request mechanism, coupled to the nodes, for communicating a disk access request from an originating node to a server node physically attached to the disk along one of at least a primary disk access path and a secondary disk access path defined between the originating node, the server nodes and the disk;
  
  a failure detection mechanism, coupled to the nodes, for detecting failures along one of the primary disk access path and the secondary disk access path; and
  
  ,proxy logic means, coupled to the failure detection mechanism, for redirecting subsequent disk access requests via a two-phase commit protocol along a non-failing disk access path to the disk, when a failure is detected.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
International Business Machines Corporation
Original Assignee
International Business Machines Corporation
Inventors
Polyzois, Christos Alkiviadis, Butrico, Maria Angela, Peterson, James Lyle, Smith, Stephen Edwin, Attanasio, Clement Richard
Primary Examiner(s)
Beausoliel, Jr., Robert W.
Assistant Examiner(s)
Palys, Joseph E.

Application Number

US08/653,098
Time in Patent Office

480 Days
Field of Search

395/182.02, 395/182.03, 395/182.04, 395/182.05, 395/183.18, 395/183.19, 395/182.08, 395/200.08, 395/182.13
US Class Current

714/4.2
CPC Class Codes

G06F 11/1423   by reconfiguration of paths

G06F 11/1479   Generic software techniques...

G06F 11/2033   switching over of hardware ...

G06F 11/2035   without idle spare hardware

G06F 11/2046   where the redundant compone...

G06F 11/2092   Techniques of failing over ...

G06F 2201/815   Virtual middleware or OS fu...

Virtual shared disks with application transparent recovery

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

285 Citations

9 Claims

Specification

Solutions

Use Cases

Quick Links

Virtual shared disks with application transparent recovery

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

285 Citations

9 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links