Enhancing reliability and robustness of a cluster
First Claim
Patent Images
1. A method, comprising:
- receiving a request from an operating system (OS) to unload an Input/Output (I/O) controller driver stack; and
delaying a response to the OS after determining the request is associated with a transient communication error between a host and an I/O controller, which is recoverable after a pre-determined period of time.
0 Assignments
0 Petitions
Accused Products
Abstract
A cluster having a host connected thereto via a cluster interconnection fabric where a determination is made as to whether an error condition exists in an I/O controller connected to the host via the cluster interconnection fabric. This is achieved by attempting to communicate with the I/O controller a first predetermined time period after an inquiry by an operating system as to whether or not an I/O controller driver stack should be unloaded. Also, the operating system is commanded to unload the I/O controller driver stack upon a determination that the error condition still exists.
15 Citations
20 Claims
-
1. A method, comprising:
-
receiving a request from an operating system (OS) to unload an Input/Output (I/O) controller driver stack; and delaying a response to the OS after determining the request is associated with a transient communication error between a host and an I/O controller, which is recoverable after a pre-determined period of time. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system, comprising:
-
an Input/Output (I/O) controller driver stack; and a control driver, wherein the control driver monitors communication errors between a host and an I/O controller, and if a communication error is transient, the control driver delays responding to an Operating System (OS) request to unload the I/O controller driver stack from the host for a predetermined period of time, during which if communication is re-established between the host and the I/O controller, the OS request is denied. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method, comprising:
-
receiving from an Operating System (OS) a request to unload an Input/Output (I/O) controller driver stack from memory of a host; determining the request is associated with a transient error condition associated with a temporarily loss of communications between the host and an I/O controller; and delaying a response to the OS for at least a predetermined period of time to determine if the loss of communications is resolved and if resolved, denying the request of the OS. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification