Synchronized communication between multi-processor clusters of multi-cluster computer systems
First Claim
1. A computer system, comprising:
- a first cluster including a first plurality of processors and a first interconnection controller, the first plurality of processors and the first interconnection controller interconnected by first point-to-point intra-cluster links; and
a second cluster including a second plurality of processors and a second interconnection controller, the second plurality of processors and the second interconnection controller interconnected by second point-to-point intra-cluster links, the first interconnection controller coupled to the second interconnection controller by point-to-point inter-cluster links, the first and second interconnection controllers configured to;
perform an initialization sequence that establishes a characteristic skew pattern between data lanes of the point-to-point inter-cluster links;
encode clock data in each symbol transmitted on the point-to-point inter-cluster links;
recover clock data from each symbol received on the point-to-point inter-cluster links; and
apply the characteristic skew pattern to correct for skew between data lanes of the point-to-point inter-cluster links.
9 Assignments
0 Petitions
Accused Products
Abstract
Improved techniques are provided for detecting and correcting errors and skew in inter-cluster communications within computer systems having a plurality of multi-processor clusters. The local nodes of each cluster include a plurality of processors and an interconnection controller. Intra-cluster links are formed between the local nodes, including the interconnection controller, within a cluster. Inter-cluster links are formed between interconnection controllers of different clusters. Intra-cluster packets may be serialized and encapsulated as inter-cluster packets for transmission on inter-cluster links, preferably with link-layer encapsulation. Each inter-cluster packet may include a sequence identifier and error information computed for that packet. Clock data may be embedded in symbols sent on each bit lane of the inter-cluster links. Copies of transmitted inter-cluster packets may be stored until an acknowledgement is received. The use of inter-cluster packets on an inter-cluster link is preferably transparent to other links and to the protocol layer.
57 Citations
16 Claims
-
1. A computer system, comprising:
-
a first cluster including a first plurality of processors and a first interconnection controller, the first plurality of processors and the first interconnection controller interconnected by first point-to-point intra-cluster links; and
a second cluster including a second plurality of processors and a second interconnection controller, the second plurality of processors and the second interconnection controller interconnected by second point-to-point intra-cluster links, the first interconnection controller coupled to the second interconnection controller by point-to-point inter-cluster links, the first and second interconnection controllers configured to;
perform an initialization sequence that establishes a characteristic skew pattern between data lanes of the point-to-point inter-cluster links;
encode clock data in each symbol transmitted on the point-to-point inter-cluster links;
recover clock data from each symbol received on the point-to-point inter-cluster links; and
apply the characteristic skew pattern to correct for skew between data lanes of the point-to-point inter-cluster links. - View Dependent Claims (2, 3, 4, 5)
-
-
6. An interconnection controller, comprising:
-
an intra-cluster interface configured for coupling with intra-cluster links to a plurality of local processors arranged in a point-to-point architecture in a local cluster;
an inter-cluster interface configured for coupling with an inter-cluster link to a non-local interconnection controller in a non-local cluster; and
a transceiver configured to;
perform an initialization sequence with the non-local interconnection controller that establishes a characteristic skew pattern between data lanes of the point-to-point inter-cluster links;
recover clock data from symbols received on the point-to-point inter-cluster links; and
apply the characteristic skew pattern to correct for skew between data lanes of the point-to-point inter-cluster links. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
Specification