APAP I/O programmable router
First Claim
1. A parallel array computer system, comprising:
- a plurality of nodes interconnected by a plurality of inter-node communication links in an inter-node array topology, each node having a plurality of processor memory elements, each processor memory element of each node including;
a processor and a local memory coupled to said processor;
an internal input and output port coupled to a parallel internal communication path directly connected to another processor memory element in the same node for bidirectional inter-processor communication therebetween;
an external input and output port coupled to a parallel external communication path directly connected to a processor memory element of another node for bidirectional inter-processor communication therebetween, wherein the parallel external communication path is one of said plurality of inter-node communication links; and
wherein the local memory includes stored instructions selectively executed by the processor for controlling routing data and control information with another processor memory element of the same node along said parallel internal communication path, and between the node containing the processing memory element and said another node along said parallel external communication path, independent of a centralized router providing routing for each of the processor memory elements within the same node, thereby providing for a router that is fully distributed among each of the processing memory elements of each of the nodes of the parallel array computer system; and
wherein said inter-node array links are exclusively provided by each parallel external communication path of said nodes, and said inter-node array topology is not dependent on said parallel internal communication path of each of said plurality of nodes.
0 Assignments
0 Petitions
Accused Products
Abstract
A parallel array processor for massively parallel applications is formed with low power CMOS with DRAM processing while incorporating processing elements on a single chip. Eight processor memory elements on a single chip have their own associated processing element, significant memory, and I/O and are interconnected with a hypercube based, but modified, topology. These nodes are then interconnected, either by a hypercube, modified hypercube, or ring, or ring within ring network topology. The architecture uses all the pins for networking. Each chip has eight 16 bit processors, and eight respective 32K memories. I/O has three internal ports and one external port shared by the plural processors on the chip. Significant software flexibility is provided to enable quick implementation of existing programs written in common languages. The scalable chip has internal and external connections for broadcast and asynchronous SIMD, MIMD and SIMIMD (SIMD/MIMD) with dynamic switching of modes. A fully distributed programmable router is provided by the processing memory elements that form a node. There is program compatibility for the fully scalable system.
493 Citations
14 Claims
-
1. A parallel array computer system, comprising:
-
a plurality of nodes interconnected by a plurality of inter-node communication links in an inter-node array topology, each node having a plurality of processor memory elements, each processor memory element of each node including; a processor and a local memory coupled to said processor; an internal input and output port coupled to a parallel internal communication path directly connected to another processor memory element in the same node for bidirectional inter-processor communication therebetween; an external input and output port coupled to a parallel external communication path directly connected to a processor memory element of another node for bidirectional inter-processor communication therebetween, wherein the parallel external communication path is one of said plurality of inter-node communication links; and wherein the local memory includes stored instructions selectively executed by the processor for controlling routing data and control information with another processor memory element of the same node along said parallel internal communication path, and between the node containing the processing memory element and said another node along said parallel external communication path, independent of a centralized router providing routing for each of the processor memory elements within the same node, thereby providing for a router that is fully distributed among each of the processing memory elements of each of the nodes of the parallel array computer system; and wherein said inter-node array links are exclusively provided by each parallel external communication path of said nodes, and said inter-node array topology is not dependent on said parallel internal communication path of each of said plurality of nodes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
Specification