System and method for predicting single-disk failures
First Claim
1. A computer-implemented method for predicting disk failures, the method comprising:
- receiving first values of a predetermined diagnostic parameter collected from a set of known working disks and a set of known failed disks of a storage system;
generating a quantile distribution graph based on the first values;
determining an optimal threshold of the received first values of the predetermined diagnostic parameter based on a quantile distribution of the first values against a plurality of threshold candidates, wherein the quantile distribution of the first values is derived from the quantile distribution graph; and
determining a potential risk of disk failures of one or more disks of a target storage system based on second values of the predetermined diagnostic parameter collected from the one or more disks of the target storage system in view of the optimal threshold.
9 Assignments
0 Petitions
Accused Products
Abstract
Techniques for predicting single-disk failures are described herein. According to one embodiment, first values of a predetermined diagnostic parameter collected from a set of known working disks and a set of known failed disks of a storage system are received. An optimal threshold of the received first values of the predetermined diagnostic parameter is determined based on a quantile distribution of the first values against a plurality of threshold candidates. A potential risk of disk failures of one or more disks of a target storage system is determined based on second values of the predetermined diagnostic parameter collected from the one or more disks of the target storage system in view of the optimal threshold.
42 Citations
21 Claims
-
1. A computer-implemented method for predicting disk failures, the method comprising:
-
receiving first values of a predetermined diagnostic parameter collected from a set of known working disks and a set of known failed disks of a storage system; generating a quantile distribution graph based on the first values; determining an optimal threshold of the received first values of the predetermined diagnostic parameter based on a quantile distribution of the first values against a plurality of threshold candidates, wherein the quantile distribution of the first values is derived from the quantile distribution graph; and determining a potential risk of disk failures of one or more disks of a target storage system based on second values of the predetermined diagnostic parameter collected from the one or more disks of the target storage system in view of the optimal threshold. - View Dependent Claims (2, 3)
-
-
4. A computer-implemented method for predicting disk failures, the method comprising:
-
receiving first values of a predetermined diagnostic parameter collected from a set of known working disks and a set of known failed disks of a storage system; determining an optimal threshold of the received first values of the predetermined diagnostic parameter based on a quantile distribution of the first values against a plurality of threshold candidates, wherein determining an optimal threshold comprises; generating a first set of data points based on values of the predetermined diagnostic parameter of the known failed disks in view of the threshold candidates, generating a second set of data points based on values of the predetermined diagnostic parameter of the known working disks in view of the threshold candidates, and comparing corresponding pairs of the first and second sets of data points associated with each of the threshold candidates to select one of the threshold candidates as the optimal threshold; and determining a potential risk of disk failures of one or more disks of a target storage system based on second values of the predetermined diagnostic parameter collected from the one or more disks of the target storage system in view of the optimal threshold. - View Dependent Claims (5, 6, 7)
-
-
8. A non-transitory machine-readable medium having instructions stored therein, which when executed by a processor, cause the processor to perform a operations for predicting disk failures, the operations comprising:
-
receiving first values of a predetermined diagnostic parameter collected from a set of known working disks and a set of known failed disks of a storage system; generating a quantile distribution graph based on the first values; determining an optimal threshold of the received first values of the predetermined diagnostic parameter based on a quantile distribution of the first values against a plurality of threshold candidates, wherein the quantile distribution of the first values is derived from the quantile distribution graph; and determining a potential risk of disk failures of one or more disks of a target storage system based on second values of the predetermined diagnostic parameter collected from the one or more disks of the target storage system in view of the optimal threshold. - View Dependent Claims (9, 10)
-
-
11. A non-transitory machine-readable medium having instructions stored therein, which when executed by a processor, cause the processor to perform operations for predicting disk failures, the operations comprising:
-
receiving first values of a predetermined diagnostic parameter collected from a set of known working disks and a set of known failed disks of a storage system; determining an optimal threshold of the received first values of the predetermined diagnostic parameter based on a quantile distribution of the first values against a plurality of threshold candidates, wherein determining an optimal threshold comprises; generating a first set of data points based on values of the predetermined diagnostic parameter of the known failed disks in view of the threshold candidates, generating a second set of data points based on values of the predetermined diagnostic parameter of the known working disks in view of the threshold candidates, and comparing corresponding pairs of the first and second sets of data points associated with each of the threshold candidates to select one of the threshold candidates as the optimal threshold; and determining a potential risk of disk failures of one or more disks of a target storage system based on second values of the predetermined diagnostic parameter collected from the one or more disks of the target storage system in view of the optimal threshold. - View Dependent Claims (12, 13, 14)
-
-
15. A data processing system, comprising:
-
a processor; and a memory coupled to the processor for storing instructions, which when executed from the memory, cause the processor to perform operations, the operations including receiving first values of a predetermined diagnostic parameter collected from a set of known working disks and a set of known failed disks of a storage system, generating a quantile distribution graph based on the first values, determining an optimal threshold of the received first values of the predetermined diagnostic parameter based on a quantile distribution of the first values against a plurality of threshold candidates, wherein the quantile distribution of the first values is derived from the quantile distribution graph, and determining a potential risk of disk failures of one or more disks of a target storage system based on second values of the predetermined diagnostic parameter collected from the one or more disks of the target storage system in view of the optimal threshold. - View Dependent Claims (16, 17)
-
-
18. A data processing system, comprising:
-
a processor; and a memory coupled to the processor for storing instructions, which when executed from the memory, cause the processor to perform operations, the operations including receiving first values of a predetermined diagnostic parameter collected from a set of known working disks and a set of known failed disks of a storage system determining an optimal threshold of the received first values of the predetermined diagnostic parameter based on a quantile distribution of the first values against a plurality of threshold candidates, wherein determining an optimal threshold comprises; generating a first set of data points based on values of the predetermined diagnostic parameter of the known failed disks in view of the threshold candidates, generating a second set of data points based on values of the predetermined diagnostic parameter of the known working disks in view of the threshold candidates, and comparing corresponding pairs of the first and second sets of data points associated with each of the threshold candidates to select one of the threshold candidates as the optimal threshold, and determining a potential risk of disk failures of one or more disks of a target storage system based on second values of the predetermined diagnostic parameter collected from the one or more disks of the target storage system in view of the optimal threshold. - View Dependent Claims (19, 20, 21)
-
Specification