Server initiated predictive failure analysis for disk drives
First Claim
1. A method for performing predictive failure analysis (PFA) on one or more disk drives in a server system, the method comprising the steps of:
- monitoring the server system at the system level for a PFA triggering event;
issuing a PFA initiation command to the one or more disk drives if a PFA triggering event is encountered on the server system;
performing PFA on the one or more disk drives; and
reporting a PFA error to the server system if any of the one or more disk drives fails the PFA.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention provides a method, apparatus and program product for improving reliability in RAID/Server systems by monitoring the RAID/Server system at the system level for predictive failure analysis (PFA) triggering events. Examples of PFA triggering events include: rebuild operations, addition of new disk drives, a change in usage patterns, and suspected handling damage. Once a triggering event is detected, the RAID/Server system issues a command to the disk drives in the system to begin performing PFA. If a PFA error is detected on any of the drives, the error is reported to the RAID/Server system.
44 Citations
21 Claims
-
1. A method for performing predictive failure analysis (PFA) on one or more disk drives in a server system, the method comprising the steps of:
-
monitoring the server system at the system level for a PFA triggering event;
issuing a PFA initiation command to the one or more disk drives if a PFA triggering event is encountered on the server system;
performing PFA on the one or more disk drives; and
reporting a PFA error to the server system if any of the one or more disk drives fails the PFA. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer system, comprising:
-
a server;
a Predictive Failure Analysis (PFA) monitor incorporated within the server, the PFA monitor capable of detecting a system level PFA triggering event, and issuing a PFA initiate command if a PFA triggering event is detected;
one or more Redundant Array of Independent Disk (RAID) units operably connected to the server; and
a plurality of disk drives incorporated within each of the one or more RAID units, wherein each of the plurality of disk drives is capable of receiving the PFA initiate command issued by the PFA monitor, the PFA command triggering a PFA measurement within each of the plurality of disk drives. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. A Redundant Array of Independent Disks (RAID), comprising:
-
a RAID controller;
a predictive failure analysis (PFA) monitor incorporated within the RAID controller, the PFA monitor capable of detecting a system level triggering event, and issuing a PFA initiate command if the PFA triggering event is detected; and
a plurality of disk drives incorporated within the RAID, wherein each of the plurality of disk drives is capable of receiving the PFA command issued by the PFA monitor, the PFA command triggering a PFA measurement within each of the plurality of disk drives. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A program product, comprising:
-
a predictive failure analysis (PFA) mechanism that monitors a server system at a system level for a PFA triggering event, issues a PFA initiation command to one or more disk drives if a PFA triggering event is encountered on the server system, performs PFA on the one or more disk drives, and reports a PFA error to the server system if any of the one or more disk drives fails the PFA, and computer-readable signal bearing media bearing the PFA mechanism. - View Dependent Claims (20, 21)
-
Specification