Efficient system for predicting and processing storage subsystem failure
First Claim
1. A method for data recovery in a storage system including a supervising processor coupled to a parity-equipped RAID storage subsystem having multiple head disk assemblies ("HDA") each HDA including an HDA controller and at least one storage medium, said method comprising the steps of:
- the supervising processor receiving a Read request for reading target data;
the supervising processor directing a first HDA to read the target data;
the first HDA attempting to read the target data and detecting a data error during the attempt;
the first HDA transmitting a recovery alert signal indicative of the data error to the supervising processor;
the first HDA initiating a retry process to provide an output of the target data;
the supervising processor initiating a reconstruction process concurrently with the retry process to provide an output of the target data by combining supplementary data and stored parity, said supplementary data comprising data corresponding to the target data and stored elsewhere in the RAID storage subsystem than the first HDA, and said stored parity comprising parity corresponding to the target data and supplementary data and stored in the RAID storage subsystem; and
determining which of the retry and reconstruction processes first completes;
providing an output of target data from the first completing process in response to the read request; and
aborting the process not completing first.
2 Assignments
0 Petitions
Accused Products
Abstract
Predictive failure analysis of a storage subsystem is efficiently conducted and data quickly recovered from a failed Read operation. This may be implemented in a storage system including a host coupled to a supervising processor that couples to a parity-equipped RAID storage subsystem having multiple HDAs each including an HDA controller and at least one storage medium. In one embodiment, when an HDA experiences an error during a Read attempt, the HDA transmits a recovery alert signal to the supervising processor; then, the processor and HDA begin remote and local recovery processes in parallel. The first process to complete provides the data to the host, and the second process is aborted. In another embodiment, an HDA'"'"'s PFA operations are restricted to idle times of the HDA. A different embodiment limits HDA performance of PFA to times when the processor is conducting data reconstruction. Another embodiment monitors HDA errors at the supervisory processor level, initiating an HDA'"'"'s PFA operations when errors at that HDA have a certain characteristic, such as a predetermined frequency of occurrence.
-
Citations
10 Claims
-
1. A method for data recovery in a storage system including a supervising processor coupled to a parity-equipped RAID storage subsystem having multiple head disk assemblies ("HDA") each HDA including an HDA controller and at least one storage medium, said method comprising the steps of:
-
the supervising processor receiving a Read request for reading target data; the supervising processor directing a first HDA to read the target data; the first HDA attempting to read the target data and detecting a data error during the attempt; the first HDA transmitting a recovery alert signal indicative of the data error to the supervising processor; the first HDA initiating a retry process to provide an output of the target data; the supervising processor initiating a reconstruction process concurrently with the retry process to provide an output of the target data by combining supplementary data and stored parity, said supplementary data comprising data corresponding to the target data and stored elsewhere in the RAID storage subsystem than the first HDA, and said stored parity comprising parity corresponding to the target data and supplementary data and stored in the RAID storage subsystem; and determining which of the retry and reconstruction processes first completes; providing an output of target data from the first completing process in response to the read request; and aborting the process not completing first. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A program instructions storage medium tangibly embodying a machine-readable program of instructions to perform method steps for recovery in a storage system that includes a supervising processor coupled to a parity-equipped RAID storage subsystem having multiple head disk assemblies ("HDA") each including an HDA controller and at least one storage medium, said method steps comprising:
-
the supervising processor receiving a Read request for reading target data; the supervising processor directing a first HDA to read the target data; the first HDA attempting to read the target data and detecting a data error during the attempt; the first HDA transmitting a recovery alert signal indicative of the data error to the supervising processor; the first HDA initiating a retry process to provide an output of the target data; the supervising processor initiating a reconstruction process concurrently with the retry process to provide an output of the target data by combining supplementary data and stored parity, said supplementary data comprising data corresponding to the target data and stored elsewhere in the RAID storage subsystem than the first HDA, and said stored parity comprising parity corresponding to the target data and supplementary data and stored in the RAID storage subsystem; and determining which of the retry and reconstruction processes first completes; providing an output of target data from the first completing process in response to the Read request; and aborting the process not completing first. - View Dependent Claims (7, 8, 9, 10)
-
Specification