×

Recovery from transitory storage area network component failures

  • US 8,051,335 B1
  • Filed: 06/02/2008
  • Issued: 11/01/2011
  • Est. Priority Date: 06/02/2008
  • Status: Active Grant
First Claim
Patent Images

1. A mass data storage system comprising a storage server computer, a data storage subsystem, and a plurality of communication paths connecting the storage server computer to the data storage subsystem over which communications are conducted between the storage server computer and the data storage subsystem, each communication path subject to multiple different types of errors each having a specific severity which negatively affects the communications over that communication path;

  • and wherein the storage server computer is operative to;

    detect each type of error which occurs on each communication path;

    count each detected error for each communication path on a continuous ongoing basis to establish an accumulated count value equal to the number of each type of detected error for each communication path;

    decrement the accumulated count value for each type of detected error for each communication path by a predetermined amount at periodic intervals to establish a decremented accumulated count value until the decremented accumulated count value for each type of error for each communication path reaches a zero value;

    attribute a weight value for each type of detected error related to the severity of the type of detected error;

    calculate a weighted error value for each communication path by multiplying the weight value for each type of error by the decremented accumulated count value for each type of error and adding the results of such multiplications for each communication path;

    establish a first fault threshold of weighted error values indicative of unreliable communications over each communication path;

    establish a second fault threshold of weighted error values indicative of reliable communications over each communication path, the second fault threshold being less than the first fault threshold;

    cease use of any one communication path for communications when the weighted error value for that one communication path exceeds or is equal to the first fault threshold; and

    resume use of the one communication path when the weighted error value for that one communication path is less than or equal to the second fault threshold.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×