Method and system for proactive drive replacement for high availability storage systems
First Claim
Patent Images
1. A method for disk drive replacement in a power-managed RAID storage system having a plurality of disk drives that are powered-on at a time before data access and are otherwise powered-off, the method comprising:
- monitoring a factor relating to powering-on a particular disk drive in the plurality of disk drives;
determining a manufacturer threshold for replacement of the particular disk drive, the manufacturer threshold specified by a manufacturer of the particular disk drive for the factor;
determining a percentage threshold less than the manufacturer threshold specified by the manufacturer;
predicting a time at which the particular disk drive will fail based on the percentage threshold as applied to the factor and based on a power management characteristic of the particular disk drive;
powering on a replacement disk drive if it is powered off; and
transferring data from the particular disk drive to the replacement disk drive if the predicted time is below the percentage threshold before failure of the particular disk drive; and
using the replacement disk drive in place of the particular disk drive.
11 Assignments
0 Petitions
Accused Products
Abstract
Methods for preventing the failure of disk drives in storage systems are disclosed. A system and a computer program product for preventing the failure are also disclosed. Factors relating to the aging or early onset of errors in a disk drive are monitored. These factors are then compared to thresholds. In case the thresholds are exceeded, an indication for the replacement of the disk drive is given. Sudden rises in the factors are also used to indicate the impeding failure of disk drives.
94 Citations
22 Claims
-
1. A method for disk drive replacement in a power-managed RAID storage system having a plurality of disk drives that are powered-on at a time before data access and are otherwise powered-off, the method comprising:
-
monitoring a factor relating to powering-on a particular disk drive in the plurality of disk drives; determining a manufacturer threshold for replacement of the particular disk drive, the manufacturer threshold specified by a manufacturer of the particular disk drive for the factor; determining a percentage threshold less than the manufacturer threshold specified by the manufacturer; predicting a time at which the particular disk drive will fail based on the percentage threshold as applied to the factor and based on a power management characteristic of the particular disk drive; powering on a replacement disk drive if it is powered off; and transferring data from the particular disk drive to the replacement disk drive if the predicted time is below the percentage threshold before failure of the particular disk drive; and using the replacement disk drive in place of the particular disk drive. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An apparatus for disk drive replacement in a power-managed RAID storage system having a plurality of disk drives that are powered-on at a time before data access and are otherwise powered-off, the apparatus comprising:
-
a processor; a machine-readable storage medium including instructions executable by the processor for; monitoring a factor relating to powering-on a particular disk drive in the plurality of disk drives; determining a manufacturer threshold for replacement of the particular disk drive, the manufacturer threshold specified by a manufacturer of the particular disk drive for the factor; determining a percentage threshold less than the manufacturer threshold specified by the manufacturer; predicting a time at which the particular disk drive will fail based on the percentage threshold as applied to the factor and based on a power management characteristic of the particular disk drive; powering on a replacement disk drive if it is powered off; and transferring data from the particular disk drive to a replacement disk drive if the predicted time is below the threshold before failure of the particular disk drive; and using the replacement disk drive in place of the particular disk drive.
-
-
13. A machine-readable storage medium including instructions executable by a processor for preventing disk drive failures in a power-managed RAID storage system having a plurality of disk drives that are powered-on at a time before data access and are otherwise powered-off, the machine-readable storage medium comprising one or more instructions for:
-
monitoring a factor relating to powering-on a particular disk drive in the plurality of disk drives; determining a manufacturer threshold for replacement of the particular disk drive, the manufacturer threshold specified by a manufacturer of the particular disk drive for the factor; determining a percentage threshold less than the manufacturer threshold specified by the manufacturer; predicting a time at which the particular disk drive will fail based on the percentage threshold as applied to the factor and based on a power management characteristic of the particular disk drive; powering on a replacement disk drive if it is powered off; and transferring data from the particular disk drive to a replacement disk drive if the predicted time is below at least one threshold before failure of the particular disk drive; and using the replacement disk drive in place of the particular disk drive. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification