System and method for isolating faults in a network
First Claim
1. An error isolation system adapted for use in a computer network in communication with a storage device configured to store error information relating to at least one detected error that occurs in the network, the system having a plurality of program modules configured to execute on at least one processor, the program modules including:
- an error identification module configured to determine whether the detected error is a write error or a read error; and
a failure region detection module configured to identify a failure region on the network, the failure region being between a first location on the network at which a write error is detected and a second location on the network at which a read error is detected.
2 Assignments
0 Petitions
Accused Products
Abstract
A fault isolation system in a network is disclosed, particularly suited for use in a unidirectional fibre channel arbitrated loop. Information relating to read and write errors occurring on the loop is stored, and fault regions are located by determining areas on the loop downstream of write errors and upstream of read errors. The system may be extended to networks with bidirectional communications by storing directionality information with the detected errors. Command and response error information is not needed to deterministically locate the fault regions. When a given fault region is identified, loop and device diagnostics are executed for that region of the loop to specifically identify the failed components.
-
Citations
20 Claims
-
1. An error isolation system adapted for use in a computer network in communication with a storage device configured to store error information relating to at least one detected error that occurs in the network, the system having a plurality of program modules configured to execute on at least one processor, the program modules including:
-
an error identification module configured to determine whether the detected error is a write error or a read error; and a failure region detection module configured to identify a failure region on the network, the failure region being between a first location on the network at which a write error is detected and a second location on the network at which a read error is detected. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for locating a fault region in a network in communication with a storage device configured to store error information relating to detected errors that occurs in the network, the method including the steps of:
-
determining whether each of a plurality of detected errors is a read error or a write error; and identifying a failure region on the network between a first location on the network where a first read error is detected and a second location on the network where a first write error is detected. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A computer program product stored on a computer-usable medium, comprising a computer-readable program configured to cause a computer to control execution of an application to determine a fault region associated with a plurality of detected errors in a network, the computer-readable program including:
-
an error identification module configured to determine whether each of the plurality of detected errors is a write error or a read error; and a failure region detection module configured to identify a failure region on the network, the failure region being between a first location on the network at which a write error is detected and a second location on the network at which a read error is detected. - View Dependent Claims (19)
-
-
20. A computer network, including:
-
a host including a processor and a host bus adapter; a loop coupled to the host bus adapter and configured to carry packets from the host bus adapter to at least one device on the loop and from the device to the host bus adapter; first logic coupled to the host and configured to detect a plurality of errors in the packets on the loop, the errors including at least one detected read error and one detected write error; and second logic coupled to the host and configured to locate a failure region on the loop downstream of the detected read error and upstream of the detected write error.
-
Specification