Method and apparatus of a profiling algorithm to quickly detect faulty disks/HBA to avoid application disruptions and higher latencies
First Claim
Patent Images
1. A method for determining a faulty hardware component within a data storage system, comprising:
- collecting, by a processor, data relating to a plurality of input/output (IO) errors associated with a first storage processor within the data storage system, wherein the data storage system includes a plurality of disk array enclosures (DAEs), each DAE having one or more disk drives;
compiling, by the processor, IO error statistics based on the data relating to the plurality of IO errors, the IO error statistics being related to a first one of the DAEs of the data storage system; and
determining, by the processor, a faulty hardware component based on the IO error statistics, wherein the determining of the faulty hardware component comprises utilizing a second storage processor of the data storage system independent from the first storage processor, including examining IO access statistics of the second storage processor for accessing the first DAE through a different path, andwherein the plurality of DAEs are connected to the first storage processor and the second storage processor through an independent first path and an independent second path, and each of the one or more disk drives has a first port connected to the first storage processor through the first path and a second port connected to the second storage processor through the second path.
8 Assignments
0 Petitions
Accused Products
Abstract
One embodiment is related to a method for determining a faulty hardware component within a data storage system, comprising: collecting data relating to a plurality of input/output (IO) errors associated with a first storage processor within the data storage system; compiling IO error statistics based on the data relating to the plurality of IO errors; and determining a faulty hardware component based on the IO error statistics, wherein the determining of the faulty hardware component comprises utilizing a second storage processor of the data storage system independent from the first storage processor.
13 Citations
21 Claims
-
1. A method for determining a faulty hardware component within a data storage system, comprising:
-
collecting, by a processor, data relating to a plurality of input/output (IO) errors associated with a first storage processor within the data storage system, wherein the data storage system includes a plurality of disk array enclosures (DAEs), each DAE having one or more disk drives; compiling, by the processor, IO error statistics based on the data relating to the plurality of IO errors, the IO error statistics being related to a first one of the DAEs of the data storage system; and determining, by the processor, a faulty hardware component based on the IO error statistics, wherein the determining of the faulty hardware component comprises utilizing a second storage processor of the data storage system independent from the first storage processor, including examining IO access statistics of the second storage processor for accessing the first DAE through a different path, and wherein the plurality of DAEs are connected to the first storage processor and the second storage processor through an independent first path and an independent second path, and each of the one or more disk drives has a first port connected to the first storage processor through the first path and a second port connected to the second storage processor through the second path. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory machine-readable medium having instructions stored therein which, when executed by a processor, cause the processor to perform troubleshooting operations, the operations comprising:
-
collecting data relating to a plurality of input/output (IO) errors associated with a first storage processor within a data storage system, wherein the data storage system includes a plurality of disk array enclosures (DAEs), each DAE having one or more disk drives; compiling IO error statistics based on the data relating to the plurality of IO errors, the IO error statistics being related to a first one of the DAEs of the data storage system; and determining a faulty hardware component based on the IO error statistics, wherein the determining of the faulty hardware component comprises utilizing a second storage processor of the data storage system independent from the first storage processor, including examining IO access statistics of the second storage processor for accessing the first DAE through a different path, and wherein the plurality of DAEs are connected to the first storage processor and the second storage processor through an independent first path and an independent second path, and each of the one or more disk drives has a first port connected to the first storage processor through the first path and a second port connected to the second storage processor through the second path. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A data processing system, comprising:
-
a processor; and a memory coupled to the processor storing instructions which, when executed by the processor, cause the processor to perform troubleshooting operations, the operations including collecting data relating to a plurality of input/output (IO) errors associated with a first storage processor within the data processing system, wherein the data storage system includes a plurality of disk array enclosures (DAEs), each DAE having one or more disk drives; compiling IO error statistics based on the data relating to the plurality of IO errors, the IO error statistics being related to a first one of the DAEs of the data storage system; and determining a faulty hardware component based on the IO error statistics, wherein the determining of the faulty hardware component comprises utilizing a second storage processor of the data processing system independent from the first storage processor, including examining IO access statistics of the second storage processor for accessing the first DAE through a path, and wherein the plurality of DAEs are connected to the first storage processor and the second storage processor through an independent first path and an independent second path, and each of the one or more disk drives has a first port connected to the first storage processor through the first path and a second port connected to the second storage processor through the second path. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification