Handling zero fault tolerance events in machines where failure likely results in unacceptable loss
First Claim
1. A computer program product for managing I/O requests to a storage array of storage devices in a machine having a processor node and device adaptor, the computer program product comprising a computer readable storage medium having computer readable program code embodied therein that is executable to perform operations, the operations comprising:
- in response to the device adaptor initiating a rebuild of data at the storage devices in the storage array, determining whether a remaining fault tolerance at the storage array comprises a non-zero fault tolerance that permits at least one further storage device of the storage devices to fail and still allow recovery of data stored in the storage array; and
determining, by the device adaptor, whether processor utilization at the device adaptor exceeds a utilization threshold after determining that the remaining fault tolerance is not a zero fault tolerance;
initiating, by the device adaptor, an operation to reduce a rate at which I/O requests to the storage array are processed at the device adaptor in response to determining that the processor utilization at the device adaptor exceeds the utilization threshold.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided are a computer program product, system, and method for managing I/O requests to a storage array of storage devices in a machine having a processor node and device adaptor. In response to initiating a rebuild of data in the storage array, the device adaptor determines whether a remaining fault tolerance at the storage array comprises a non-zero fault tolerance that permits at least one further storage device to fail and still allow recovery of data stored in the storage array. In response to determining that the remaining fault tolerance is a zero fault tolerance that does not permit at least one storage device to fail and allow recovery of data, the device adaptor sends a message to the processor node to cause the processor node to initiate an emergency protocol to terminate a mission critical operation when the processor node is performing the mission critical operation.
-
Citations
20 Claims
-
1. A computer program product for managing I/O requests to a storage array of storage devices in a machine having a processor node and device adaptor, the computer program product comprising a computer readable storage medium having computer readable program code embodied therein that is executable to perform operations, the operations comprising:
-
in response to the device adaptor initiating a rebuild of data at the storage devices in the storage array, determining whether a remaining fault tolerance at the storage array comprises a non-zero fault tolerance that permits at least one further storage device of the storage devices to fail and still allow recovery of data stored in the storage array; and determining, by the device adaptor, whether processor utilization at the device adaptor exceeds a utilization threshold after determining that the remaining fault tolerance is not a zero fault tolerance; initiating, by the device adaptor, an operation to reduce a rate at which I/O requests to the storage array are processed at the device adaptor in response to determining that the processor utilization at the device adaptor exceeds the utilization threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for managing I/O requests, comprising:
-
a processor node; a storage array of storage devices; a device adaptor managing access to the storage array, wherein the device adaptor performs operations, the operations comprising; in response to the device adaptor initiating a rebuild of data at the storage devices in the storage array, determining whether a remaining fault tolerance at the storage array comprises a non-zero fault tolerance that permits at least one further storage device of the storage devices to fail and still allow recovery of data stored in the storage array; and determining, by the device adaptor, whether processor utilization at the device adaptor exceeds a utilization threshold after determining that the remaining fault tolerance is not a zero fault tolerance; initiating, by the device adaptor, an operation to reduce a rate at which I/O requests to the storage array are processed at the device adaptor in response to determining that the processor utilization at the device adaptor exceeds the utilization threshold. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method for managing I/O requests to a storage array of storage devices in a machine having a processor node and device adaptor, comprising:
-
in response to the device adaptor initiating a rebuild of data at the storage devices in the storage array, determining whether a remaining fault tolerance at the storage array comprises a non-zero fault tolerance that permits at least one further storage device of the storage devices to fail and still allow recovery of data stored in the storage array; and determining, by the device adaptor, whether processor utilization at the device adaptor exceeds a utilization threshold after determining that the remaining fault tolerance is not a zero fault tolerance; initiating, by the device adaptor, an operation to reduce a rate at which I/O requests to the storage array are processed at the device adaptor in response to determining that the processor utilization at the device adaptor exceeds the utilization threshold. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification