System and Method for Providing Reliability of Communication Between Supernodes of a Multi-Tiered Full-Graph Interconnect Architecture
First Claim
1. A method, in a data processing system, for providing reliability of communication, the method comprising:
- determining a current state of links coupled to ports of a first processor of the data processing system, wherein each port of the first processor comprises a plurality of links to a corresponding port on a second processor of the data processing system, and wherein the current state of the links indicates a level of error associated with each link;
determining, for each link, if the level of error associated with the link exceeds a threshold;
for each link whose level of error exceeds the threshold, tagging the link with an error identifier in a switch associated with the ports of the first processor; and
reducing a level of usage for transmitting data on ports associated with links tagged with the error identifier.
3 Assignments
0 Petitions
Accused Products
Abstract
A method, computer program product, and system are provided for providing reliability of communication. A first processor determines a current state of links coupled to ports of a first processor of the data processing system. Each port of the first processor comprises a plurality of links to a corresponding port on a second processor of the data processing system. The current state of the links indicates a level of error associated with each link. The first processor determines, for each link, if a level of error associated with the link exceeds a threshold. For each link whose level of error exceeds the threshold, the first processor tags the link with an error identifier in a switch associated with the ports of the first processor. The first processor reduces a level of usage for transmitting data on ports associated with links tagged with the error identifier.
158 Citations
20 Claims
-
1. A method, in a data processing system, for providing reliability of communication, the method comprising:
-
determining a current state of links coupled to ports of a first processor of the data processing system, wherein each port of the first processor comprises a plurality of links to a corresponding port on a second processor of the data processing system, and wherein the current state of the links indicates a level of error associated with each link; determining, for each link, if the level of error associated with the link exceeds a threshold; for each link whose level of error exceeds the threshold, tagging the link with an error identifier in a switch associated with the ports of the first processor; and reducing a level of usage for transmitting data on ports associated with links tagged with the error identifier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product, for providing reliability of communication, comprising a computer useable medium having a computer readable program, wherein the computer readable program, when executed in a first processor of a data processing system, causes the first processor to:
-
determine a current state of links coupled to ports of the first processor, wherein each port of the first processor comprises a plurality of links to a corresponding port on a second processor of the data processing system, and wherein the current state of the links indicates a level of error associated with each link; determine, for each link, if the level of error associated with the link exceeds a threshold; for each link whose level of error exceeds the threshold, tag the link with an error identifier in a switch associated with the ports of the first processor; and reduce a level of usage for transmitting data on ports associated with links tagged with the error identifier. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A data processing system for providing reliability of communication, comprising:
-
a first processor; and a memory coupled to the first processor, wherein the memory comprises instructions which, when executed by the first processor, cause the first processor to; determine a current state of links coupled to ports of a first processor of the data processing system, wherein each port of the first processor comprises a plurality of links to a corresponding port on a second processor of the data processing system, and wherein the current state of the links indicates a level of error associated with each link; determine, for each link, if the level of error associated with the link exceeds a threshold; for each link whose level of error exceeds the threshold, tag the link with an error identifier in a switch associated with the ports of the first processor; and reduce a level of usage for transmitting data on ports associated with links tagged with the error identifier. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification