Fault isolation in a network
First Claim
Patent Images
1. A method to isolate a fault in a network, the method comprising:
- receiving multiple correlated fault indications from devices in the network, wherein fault indication is a loss of a portion of transmitted information while maintaining routing of data to said device;
processing said correlated fault indications with a chain of fault indication rules linked together into a binary decision path based on a set of device rules and a data flow model for the network to determine a root cause of said fault indications including using attribute data in said device rules to look up port information selected from the group consisting of;
error classification, error propagation, correlation between said ports, and topology data provided by a device provider embodied in said device rules; and
reporting said root cause to a user of the network, wherein said root cause identifies a faulty link where initial information loss occurred.
1 Assignment
0 Petitions
Accused Products
Abstract
A system to isolate a fault to a particular port from among multiple ports in a network. The network typically has a plurality of devices including hosts, storage units, and switch groups that intercommunicate via transceivers. A fault indication is received from one or more of the devices in the network. The fault indication is then processed with a chain of fault indication rules that have been linked together into a binary decision path based on a set of device rules and a data flow model for the network. This permits determining the particular port responsible for the fault, and reporting that port to a user of the network.
24 Citations
19 Claims
-
1. A method to isolate a fault in a network, the method comprising:
-
receiving multiple correlated fault indications from devices in the network, wherein fault indication is a loss of a portion of transmitted information while maintaining routing of data to said device; processing said correlated fault indications with a chain of fault indication rules linked together into a binary decision path based on a set of device rules and a data flow model for the network to determine a root cause of said fault indications including using attribute data in said device rules to look up port information selected from the group consisting of;
error classification, error propagation, correlation between said ports, and topology data provided by a device provider embodied in said device rules; andreporting said root cause to a user of the network, wherein said root cause identifies a faulty link where initial information loss occurred. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system to isolate a fault in a network including one or more hosts, comprising:
-
a processor in one said host to receive multiple correlated fault indications from devices in the network, wherein fault indication includes loss of information while maintaining routing of data to said device; said processor further to determine a faulty link where initial information loss occurred, by processing instances of said correlated fault indications with a chain of fault indication rules linked together into a binary decision path based on a set of device rules and a data flow model for the network, wherein said data flow model is based upon information about instances of ports selected from the group consisting of;
error classification, error propagation, correlation between said ports, topology data embodied in said device rules, and combination thereof; andsaid processor to report said faulty link to a user of the network. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A method to isolate a fault to a particular link among a plurality of links in a storage area network (SAN), wherein the SAN has a plurality of devices including hosts, storage units, and switch groups that intercommunicate via optical transceivers, the method comprising:
-
receiving multiple correlated recorded fault indications from at least one said device in the SAN, wherein said fault indications are associated with loss of information while maintaining routing of data to said device in receipt of said fault; wherein said fault indications are provided only through device port counters and are absent from an error log; processing said correlated fault indications to determine a faulty link where initial information loss occurred based on a chain of fault indication rules linked together into a binary decision path, wherein said fault indication rules are based on a set of device rules and a data flow model for the SAN, including using attribute data in said rules to look up port information instances selected from the group consisting of;
error classification, error propagation, correlation between said ports, topology data embodied in said device rules, and combinations thereof; andreporting said faulty link port to a user of the SAN.
-
Specification