Method and Apparatus for Routing Data in an Inter-Nodal Communications Lattice of a Massively Parallel Computer System by Dynamically Adjusting Local Routing Strategies
First Claim
1. A computer-implemented method for inter-nodal packet routing in a parallel computer system, said parallel computer system comprising a plurality of nodes arranged in a network of node-to-node communications links for inter-nodal communications, each node comprising at least one processor for executing a respective application sub-process of at least one application and a nodal memory, said method comprising the steps of:
- at each of a plurality of said nodes of said parallel computer system, automatically routing each of a plurality of packets from a respective source node to a respective destination node along said node-to-node communications links of said network using a respective local automated routing strategy, wherein at least some of said packets must traverse one or more respective intermediate nodes between the respective source node and the respective destination node;
altering said respective local automated routing strategy of at least some nodes of said plurality of nodes during execution of said at least one application;
responsive to said altering step, at each of said at least some nodes for which said respective local automated routing strategy was altered, thereafter routing each of a plurality of packets from a respective source node to a respective destination node along said node-to-node communications links of said network using a respective local automated routing strategy as altered by said altering step.
1 Assignment
0 Petitions
Accused Products
Abstract
A massively parallel computer system contains an inter-nodal communications network of node-to-node links. Each node implements a respective routing strategy for routing data through the network, the routing strategies not necessarily being the same in every node. The routing strategies implemented in the nodes are dynamically adjusted during application execution to shift network workload as required. Preferably, adjustment of routing policies in selective nodes is performed at synchronization points. The network may be dynamically monitored, and routing strategies adjusted according to detected network conditions.
34 Citations
20 Claims
-
1. A computer-implemented method for inter-nodal packet routing in a parallel computer system, said parallel computer system comprising a plurality of nodes arranged in a network of node-to-node communications links for inter-nodal communications, each node comprising at least one processor for executing a respective application sub-process of at least one application and a nodal memory, said method comprising the steps of:
-
at each of a plurality of said nodes of said parallel computer system, automatically routing each of a plurality of packets from a respective source node to a respective destination node along said node-to-node communications links of said network using a respective local automated routing strategy, wherein at least some of said packets must traverse one or more respective intermediate nodes between the respective source node and the respective destination node; altering said respective local automated routing strategy of at least some nodes of said plurality of nodes during execution of said at least one application; responsive to said altering step, at each of said at least some nodes for which said respective local automated routing strategy was altered, thereafter routing each of a plurality of packets from a respective source node to a respective destination node along said node-to-node communications links of said network using a respective local automated routing strategy as altered by said altering step. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A parallel computer system, comprising:
-
a plurality of nodes, each node having at least one processor for executing a respective application sub-process and a memory accessible by the at least one processor; a plurality of node-to-node communication links forming a process network for inter-nodal communications of application data; a respective routing mechanism in each node of said plurality of nodes for routing packets along said process network, each said routing mechanism a respective routing mechanism in each node of said plurality of nodes, each said routing mechanism automatically routing each of a plurality of packets from a respective source node to a respective destination node along said node-to-node communications links of said network using a respective automated local routing strategy, wherein at least some of said packets must traverse one or more respective intermediate nodes between the respective source node and the respective destination node, wherein, in at least some said nodes, the respective routing mechanism alters the respective automated local routing strategy of the node during execution of a distributed application in the at least some nodes, and thereafter routes each of a plurality of packets from a respective source node to a respective destination node along said node-to-node communications links of said network using a respective local automated routing strategy as altered. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A program product for inter-nodal packet routing in a parallel computer system, said parallel computer system comprising a plurality of nodes arranged in a network of node-to-node communications links for inter-nodal communications, each node comprising at least one processor for executing a respective application sub-process and a nodal memory, the program product comprising:
-
a plurality of computer executable instructions recorded on signal-bearing media, wherein said instructions, when executed by at least one node of said computer system, cause the at least one node to perform the steps of; at each node of said at least one node, automatically routing each of a plurality of packets from a respective source node to a respective destination node along said node-to-node communications links of said network using a respective local automated routing strategy, wherein at least some of said packets must traverse one or more respective intermediate nodes between the respective source node and the respective destination node; at each node of said at least one node, altering said respective local automated routing strategy during execution of said at least one application; responsive to said altering step, at each node of said at least one node, thereafter routing each of a plurality of packets from a respective source node to a respective destination node along said node-to-node communications links of said network using a respective local automated routing strategy as altered by said altering step. - View Dependent Claims (18, 19)
-
-
20. The program product of claim 20, wherein each said local automated routing strategy is constrained to route packets along a path containing a minimal number of links of said network.
Specification