Cluster of workstations for solving compute-intensive applications by exchanging interim computation results using a two phase communication protocol
First Claim
1. A computer network architecture system comprising:
- a first segment;
a second segment;
a first computer having two network interface cards associated therewith;
said first computer being connected to said first network segment through a first of said two network interface cards, and said first computer being connected to said second segment through a second of said two network interface cards;
a second computer having two network interface cards associated therewith;
said second computer being connected to said first network segment through a first of said two network interface cards, and said second computer being connected to said second segment through a second of said two network interface cards;
a third computer having one network interface card associated therewith, said third computer only being connected to said first segment through said one network interface cards; and
a fourth computer having one network interface card associated therewith, said fourth computer only being connected to said second segment through said one network interface cards;
wherein said first, second, third, and fourth computers cooperate to solve an application problem by each calculating interim results and then exchanging messages including the interim results using a two phase message passing protocol in which;
during a first phase of said two-phase message passing protocol, said first computer and said third computer exchange messages on said first network segment and said second and said fourth computer exchange messages on said second network segment;
during a second phase of said two-phase message passing protocol, said first computer and said fourth computer exchange messages on said second network segment and said second computer and said third computer exchange messages on said first network segment.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for connecting computer workstations in clusters to perform parallel-distributed processing with respect to compute-intensive applications are provided. Commodity computers/workstations and commodity network hardware are arranged to form unique-architecture building blocks (clusters) which may then act as supernodes in larger scale cluster systems. An integrated ROCC+ (Reduced Overhead Cluster Communication) message passing software system provides unique communication logic for efficient implementation of collective message passing operations between each node and supernode. According to a preferred embodiment, each building block comprises two ethernet segments and four nodes (e.g., workstations), two of which are connected by Network Interface Cards (NICs) to both segments with the remaining nodes each connected to a respective one of the segments. The ROCC+ software establishes communication links within and between the basic building blocks so as to eliminate collisions (contention) on segments while allowing for parallel (concurrent) message passing between nodes and supernodes.
110 Citations
25 Claims
-
1. A computer network architecture system comprising:
-
a first segment; a second segment; a first computer having two network interface cards associated therewith;
said first computer being connected to said first network segment through a first of said two network interface cards, and said first computer being connected to said second segment through a second of said two network interface cards;a second computer having two network interface cards associated therewith;
said second computer being connected to said first network segment through a first of said two network interface cards, and said second computer being connected to said second segment through a second of said two network interface cards;a third computer having one network interface card associated therewith, said third computer only being connected to said first segment through said one network interface cards; and a fourth computer having one network interface card associated therewith, said fourth computer only being connected to said second segment through said one network interface cards; wherein said first, second, third, and fourth computers cooperate to solve an application problem by each calculating interim results and then exchanging messages including the interim results using a two phase message passing protocol in which; during a first phase of said two-phase message passing protocol, said first computer and said third computer exchange messages on said first network segment and said second and said fourth computer exchange messages on said second network segment; during a second phase of said two-phase message passing protocol, said first computer and said fourth computer exchange messages on said second network segment and said second computer and said third computer exchange messages on said first network segment. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer network architecture system comprising m supernodes, wherein m is an integer greater than one, each of said m supernodes further comprising:
-
a first segment; a second segment; a first computer having at least two network interface cards associated therewith, said first computer being connected to said first segment through a first of said at least two network interface cards, and said first computer being connected to said second segment through a second of said at least two network interface cards; a second computer having at least two network interface cards associated therewith, said second computer being connected to said first segment through a first of said at least two network interface cards, and said second computer being connected to said second segment through a second of said at least two network interface cards; a third computer having at least one network interface card associated therewith, said third computer being connected to said first segment through a first of said at least one network interface card; and
,a fourth computer having at least one network interface card associated therewith, said fourth computer being connected to said second segment through a first of said at least one network interface card; means for interconnecting said m supernodes; and
wherein said means for interconnecting comprise a third network interface card associated with each of said first and second computers of each of said m supernodes for use in connecting said m supernodes in a daisy-chain. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. In a computer network architecture system comprising at least one supernode comprising first and second segments, first and second nodes each connected to said first and second segments, a third node connected to said first segment, and a fourth node connected to said second segment, a method for parallel-distributed processing comprising the steps of:
-
simultaneously processing data in each of said first, second, third and fourth nodes, respectively; in a first phase, passing data on said first segment from said first node to said third node and then from said third node to said first node while simultaneously passing data on said second segment from said second node to said fourth node and then from said fourth node to said second node; and
,in a second phase;
passing data on said second segment from said first node to said fourth node and then from said fourth node to said first node while simultaneously passing data on said first segment from said second node to said third node and then from said third node to said second node. - View Dependent Claims (19, 20, 21)
-
-
22. In a computer network architecture system comprising at least one supermode consisting essentially of:
- first and second segments, first and second nodes each connected to said first and second segments, a third node connected to said first segment, and a fourth node connected to said second segment, a method for parallel-distributed processing comprising the steps of;
processing data within each of said first, second, third and fourth nodes in parallel; and
,systematically passing messages comprising intermediate results of calculations between pairs of nodes in such manner as to provide all messages to all nodes.
- first and second segments, first and second nodes each connected to said first and second segments, a third node connected to said first segment, and a fourth node connected to said second segment, a method for parallel-distributed processing comprising the steps of;
-
23. A system for parallel-distributed processing made up of plurality of supernodes, each consisting essentially of:
-
first and second segments; first and second nodes each connected to said first and second segments, a third node connected to said first segment; a fourth node connected to said second segment; and
,means for establishing communication links and passing messages between each of said first, second, third and fourth nodes, said means providing for concurrent message passing between pairs of said nodes while avoiding contention on said first and second segments during message passing, and said means resulting in communication of processing results of each of said first, second, third and fourth nodes with the other three of said nodes, respectively. - View Dependent Claims (24)
-
-
25. A computer network architecture system made up of a plurality of computers arranged in supernodes, wherein each supernode consists essentially of:
-
a first segment; a second segment; a first computer having two network interface cards associated therewith, said first computer being connected to said first segment through a first of said two network interface cards, and said first computer being connected to said second segment through a second of said two network interface cards; a second computer having two network interface cards associated therewith, said second computer being connected to said first segment through a first of said two network interface cards, and said second computer being connected to said second segment through a second of said network interface cards; a third computer having one network interface card associated therewith, said third computer only being connected to said first segment through a first of said network interface cards; and
,a fourth computer having one network interface card associated therewith, said fourth computer only being connected to said second segment through a first of said one network interface cards.
-
Specification