Enhancing reliability and robustness of a cluster
0 Assignments
0 Petitions
Accused Products
Abstract
A cluster having a host connected thereto via a cluster interconnection fabric where a determination is made as to whether an error condition exists in an I/O controller connected to the host via the cluster interconnection fabric. This is achieved by attempting to communicate with the I/O controller a first predetermined time period after an inquiry by an operating system as to whether or not an I/O controller driver stack should be unloaded. Also, the operating system is commanded to unload the I/O controller driver stack upon a determination that the error condition still exists.
-
Citations
38 Claims
-
1-18. -18. (Canceled).
-
19. A method, comprising:
-
receiving a request from an operating system (OS) to unload an Input/Output (I/O) controller driver stack; and
delaying a response to the OS after determining the request is associated with a transient communication error between a host and an I/O controller, which is recoverable after a predetermined period of time. - View Dependent Claims (20, 21, 22, 23, 24, 25)
-
-
26. A system, comprising:
-
an Input/Output (I/O) controller driver stack; and
a control driver, wherein the control driver monitors communication errors between a host and an I/O controller, and if a communication error is transient the control driver delays responding to an Operating System (OS) request to unload the I/O controller driver stack from the host for a predetermined period of time, during which if communication is re-established between the host and the I/O controller the OS request is denied. - View Dependent Claims (27, 28, 29, 30, 31, 32)
-
-
33. A media having instructions thereon, and the instructions when accessed result in a machine which performs a method, the method, comprising:
-
receiving from an Operating System (OS) a request to unload an Input/Output (I/O) controller driver stack from memory of a host;
determining the request is associated with a transient error condition associated with a temporarily loss of communications between the host and an I/O controller; and
delaying a response to the OS for at least a predetermined period of time to determine if the loss of communications is resolved and if resolved, denying the request of the OS. - View Dependent Claims (34, 35, 36, 37, 38)
-
Specification