Efficient system for predicting and processing storage subsystem failure
First Claim
1. A method for data recovery in a storage system including a host coupled to a supervising processor coupled to a parity-equipped RAID storage subsystem having multiple head disk assemblies ("HDA") each HDA including an HDA controller and at least one storage medium, said method comprising the steps of:
- the supervising processor receiving from said host a Read request for reading target data;
the supervising processor directing a first HDA to read the target data;
the first HDA attempting to read the target data and detecting a data error during the attempt;
the supervising processor executing a recovery process to reconstruct the target data by combining supplementary data and stored parity, said supplementary data comprising data corresponding to the target data and stored elsewhere in the RAID storage system than the first HDA, and said stored parity comprising parity corresponding to the target data and supplementary data and stored in the RAID storage subsystem;
concurrently with the supervising processor executing the recovery process, the first HDA performing a selected predictive failure analysis ("PFA") to predict future failure of the at least one storage medium of the first HDA; and
after completion of the recovery process, providing reconstructed target data from the supervising processor to the host.
0 Assignments
0 Petitions
Accused Products
Abstract
Predictive failure analysis of a storage subsystem is efficiently conducted and data quickly recovered from a failed Read operation. This may be implemented in a storage system including a host coupled to a supervising processor that couples to a parity-equipped RAID storage subsystem having multiple HDAs each including an HDA controller and at least one storage medium. In one embodiment, when an HDA experiences an error during a Read attempt, the HDA transmits a recovery alert signal to the supervising processor; then, the processor and HDA begin remote and local recovery processes in parallel. The first process to complete provides the data to the host, and the second process is aborted. In another embodiment, an HDA'"'"'s PFA operations are restricted to idle times of the HDA. A different embodiment limits HDA performance of PFA to times when the processor is conducting data reconstruction. Another embodiment monitors HDA errors at the supervisory processor level, initiating an HDA'"'"'s PFA operations when errors at that HDA have a certain characteristic, such as a predetermined frequency of occurrence.
-
Citations
10 Claims
-
1. A method for data recovery in a storage system including a host coupled to a supervising processor coupled to a parity-equipped RAID storage subsystem having multiple head disk assemblies ("HDA") each HDA including an HDA controller and at least one storage medium, said method comprising the steps of:
-
the supervising processor receiving from said host a Read request for reading target data; the supervising processor directing a first HDA to read the target data; the first HDA attempting to read the target data and detecting a data error during the attempt; the supervising processor executing a recovery process to reconstruct the target data by combining supplementary data and stored parity, said supplementary data comprising data corresponding to the target data and stored elsewhere in the RAID storage system than the first HDA, and said stored parity comprising parity corresponding to the target data and supplementary data and stored in the RAID storage subsystem; concurrently with the supervising processor executing the recovery process, the first HDA performing a selected predictive failure analysis ("PFA") to predict future failure of the at least one storage medium of the first HDA; and after completion of the recovery process, providing reconstructed target data from the supervising processor to the host. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A data storage medium tangibly embodying a computer-processor-readable program of instructions for causing a supervising processor and a first one of a multiple of head disk assemblies to perform method steps for data recovery in a storage system, which storage system including a parity-equipped RAID storage subsystem coupled to said supervising processor and having said multiple of head disk assemblies ("HDA"), each HDA including an HDA controller and at least one storage medium, said method steps comprising:
-
the supervising processor receiving from said host a Read request for reading target data; the supervising processor directing a first HDA to read the target data; the first HDA attempting to read the target data and detecting a data error during the attempt; the supervising processor executing a recovery process to reconstruct the target data by combining supplementary data and stored parity, said supplementary data comprising data corresponding to the target data and stored elsewhere in the RAID storage system than the first HDA, and said stored parity comprising parity corresponding to the target data and supplementary data and stored in the RAID storage subsystem; concurrently with the supervising processor executing the recovery process, the first HDA performing a selected predictive failure analysis ("PFA") to predict future failure of the at least one storage medium of the first HDA; and after completion of the recovery process, providing reconstructed target data from the supervising processor to the host. - View Dependent Claims (7, 8, 9, 10)
-
Specification