Method and apparatus for isolating faults in a network having serially connected links
First Claim
1. In a system having a plurality of processing units and at least one control unit and at least one intermediate unit for coupling the other units via fiber optic links between each of the units, and whereineach of said units has a fiber optic link adapter interface which may be actuated to form either a static or dynamic connection between the link adapter interfaces at the ends of each link units for establishing bidirectional communication between the units over the links, and wherein a processor or a control unit can function either as an originating unit or a destination unit in bidirectional communication between units, and in which an originating unit transmits data to a destination unit via an intermediate unit, said originating unit being coupled to said intermediate unit via a first link, said intermediate unit being coupled to said destination unit via a second link, said intermediate unit propagating at least some errors originating on said first link to said second link, with either said first link or said second link possibly having a fault and where faults generate error fault signals which may originate from several link adapters of the system from the plural links of the system simultaneously, and where errors occurring on a source link rating below a threshold are propagated to the destination link such that from the standpoint of the destination unit they are indistinguishable from errors arising on the destination link, and wherein said system has a fault analyzer for receiving a fault signal originating from said link adapters for isolating a link on which a fault originates, said fault analyzer providing a correct attribution of errors to a failing link when errors may occur either on a source link or a destination link without mistaking detected threshold crossings caused by cumulative errors occurring over a link, by a method of isolating errors originating on said links including the steps of:
- (a) detecting the occurrence of errors on said first link in excess of a first threshold;
(b) detecting the occurrence of errors on said second link in excess of a second threshold greater than said first threshold; and
(c) attributing said errors to said links on the basis of the detection of said errors in excess of said thresholds, by(1) attributing errors to said first link when errors have been detected in both of said links in excess of the thresholds on both of said links, and(2) attributing errors to the link having an error detected when errors have been detected in only one of said links in excess of the threshold for the link.
0 Assignments
0 Petitions
Accused Products
Abstract
A fault isolation system for use in a network in which bit errors resulting in detected code violation errors or disparity errors may be propagated from a first link to a second link of a network in which multiple serially coupled links may couple two devices. Errors in excess of a first threshold are detected at the receiving end of the first link, while errors in excess of a second threshold greater than the first threshold are detected at the receiving end of the second link. Errors are attributed to the first link in response to the detection of errors in excess of the thresholds on both links and are attributed to the link for which the threshold was exceeded in response to the detection of errors in excess of the threshold on only one of the links. If a substantial number of the bit errors resulting in the threshold being exceeded on the second link originated on the first link, the first threshold will also be exceeded with a high degree of probability. This prevents the erroneous identification of the second link as being the source of the fault where the fault actually originated from a condition on the first link.
88 Citations
20 Claims
-
1. In a system having a plurality of processing units and at least one control unit and at least one intermediate unit for coupling the other units via fiber optic links between each of the units, and wherein
each of said units has a fiber optic link adapter interface which may be actuated to form either a static or dynamic connection between the link adapter interfaces at the ends of each link units for establishing bidirectional communication between the units over the links, and wherein a processor or a control unit can function either as an originating unit or a destination unit in bidirectional communication between units, and in which an originating unit transmits data to a destination unit via an intermediate unit, said originating unit being coupled to said intermediate unit via a first link, said intermediate unit being coupled to said destination unit via a second link, said intermediate unit propagating at least some errors originating on said first link to said second link, with either said first link or said second link possibly having a fault and where faults generate error fault signals which may originate from several link adapters of the system from the plural links of the system simultaneously, and where errors occurring on a source link rating below a threshold are propagated to the destination link such that from the standpoint of the destination unit they are indistinguishable from errors arising on the destination link, and wherein said system has a fault analyzer for receiving a fault signal originating from said link adapters for isolating a link on which a fault originates, said fault analyzer providing a correct attribution of errors to a failing link when errors may occur either on a source link or a destination link without mistaking detected threshold crossings caused by cumulative errors occurring over a link, by a method of isolating errors originating on said links including the steps of: -
(a) detecting the occurrence of errors on said first link in excess of a first threshold; (b) detecting the occurrence of errors on said second link in excess of a second threshold greater than said first threshold; and (c) attributing said errors to said links on the basis of the detection of said errors in excess of said thresholds, by (1) attributing errors to said first link when errors have been detected in both of said links in excess of the thresholds on both of said links, and (2) attributing errors to the link having an error detected when errors have been detected in only one of said links in excess of the threshold for the link. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. In a system for bidirectional communication among digital units, comprising:
-
a plurality of processing units and at least one control unit and at least one intermediate unit for coupling the other units via fiber optic links between each of the units, each of said units having a fiber optic link adapter interface which may be actuated to form either a static or dynamic connection between the link adapter interfaces at the ends of each link units for establishing bidirectional communication between the units over the links, and wherein a processor or a control unit can function either as an originating unit or a destination unit in bidirectional communication between units, and in which an originating unit transmits data to a destination unit via an intermediate unit, said originating unit being coupled to said intermediate unit via a first link, said intermediate unit being coupled to said destination unit via a second link, said intermediate unit propagating at least some errors originating on said first link to said second link, with either said first link or said second link possibly having a fault and where faults generate error fault signals which may originate from several link adapters of the system from the plural links of the system simultaneously, and where errors occurring on a source link rating below a threshold are propagated to the destination link such that from the standpoint of the destination unit they are indistinguishable from errors arising on the destination link, and wherein said system has a fault analyzer for receiving a fault signal originating form said link adapters for isolating a link on which a fault originates, said fault analyzer providing a correct attribution of errors to a failing link when errors may occur either on a source link or a destination link without mistaking detected threshold crossings caused by cumulative errors occurring over a link, said system including for isolating errors originating on said links; (a) means for detecting the occurrence of errors on said first link in excess of a first threshold; (b) means for detecting the occurrence of errors on said second link in excess of a second threshold greater than said first threshold; and (c) means responsive to said detecting means for attributing said errors to said links after the detection of said errors in excess of said thresholds, by (1) attributing errors to said first link when errors have been detected in both of said links in excess of the thresholds on both of said links, and (b
2) attributing errors to the link having an error detected when errors have been detected in only one of said links in excess of the threshold for the link. - View Dependent Claims (12, 13, 14, 15)
-
-
16. In a system having a plurality of processing units and at least one control unit and at least one intermediate unit coupling the other units via an interconnection switch of the intermediate unit and fiber optic links between each of the units, and wherein each of said units has a fiber optic link adapter interface for coupling the unit to a link, and wherein said intermediate unit includes an interconnection switch which may be actuated to from either a static or dynamic connection between the link adapter interfaces at the ends of each link units for establishing bidirectional communication between the units over the links, and in which an originating unit transmits data a destination unit via an intermediate unit, said originating unit being coupled to said intermediate unit via a first link, said intermediate unit being coupled to said destination unit via a second link, said intermediate unit propagating at least some errors originating on said first link to said second link, a method of isolating errors originating on said links including the steps of:
-
(a) detecting the occurrence of errors on said first link in excess of a first threshold; (b) detecting the occurrence of errors on said second link in excess of a second threshold; (c) attributing said errors to said first link in response to the detecting of errors in excess of the thresholds on both of said links; and (d) attributing said errors to the link for which the threshold was exceeded in response to the detecting of errors in excess of the threshold on only one of said links. - View Dependent Claims (17, 18, 19, 20)
-
Specification