Modular array processor architecture having a plurality of interconnected load-balanced parallel processing nodes

US 5,701,482 A
Filed: 11/06/1995
Issued: 12/23/1997
Est. Priority Date: 09/03/1993
Status: Expired due to Fees

First Claim

Patent Images

1. An expandable modular array processor architecture comprising:

a plurality of processing nodes, wherein at least one of said plurality of processing nodes is operable to perform system startup and, wherein each processing node comprises;

an arithmetic processor having an input/output port for high speed receiving of data or transmitting of data from an external source that is to be processed, and dedicated local memories, said arithmetic processor further operable to execute signal processing primitive functions;

a control processor for controlling processing activity for all processors contained in the plurality of processing nodes and reallocate tasks assigned for processing in its node to available processors in other nodes based on a predetermined set of rules that are implemented by means of a heuristic task scheduling program, said control processors operable upon system startup to perform self tests and then tests of said processing nodes;

a large capacity node memory that also comprises a portion of a distributed global memory, said large capacity node memory operable to store intermediate data and results; and

a network interface coupled between the control processor, the arithmetic processor and the node memory;

a data bus coupled between respective arithmetic processors and network interfaces of each of the plurality of processing nodes; and

a control bus coupled between the respective arithmetic processors and network interfaces of each of the plurality of processing nodes;

wherein respective network interfaces link the respective arithmetic processors, node memories and control processors together to provide for communication therebetween and permit each node to communicate with respective node memories of all other processing nodes to provide for load balancing therebetween, and to buffer data transferred over the data and control buses to a respective node, and to operate as high-speed DMA controllers to transfer data between the arithmetic processor and node memory of a processing node independent of the control processor in that node.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A modular array processor architecture (10) comprising a plurality of interconnected parallel processing node (11)s that each comprise a control processor (12), an arithmetic processor (13) having an input port (22) for receiving data from an external source that is to be processed, a node memory (14) that also comprises a portion of a distributed global memory, and a network interface (15) coupled between the control processor (12), the arithmetic processor (13), and the node memory (14). Data and control buses (17, 18) are coupled between the arithmetic processors (13) and network interfaces (14) of each of the processing nodes (11). Respective network interfaces (15) link each of the arithmetic processors (13), node memories (14) and control processors (12) together to provide for communication throughout the architecture (10) and permit each node to communicate with the node memories (14) of all other processing nodes (11). This linking, along with the use of a heuristic scheduling algorithm, provides for load balancing between the processing nodes (11). Data queues are segmented and distributed across the architecture (10) in a way that the source and destination nodes (11) process data locally in the memory (14), while overflow is kept in distributed bulk memories (14). The network interfaces (15) buffer data transferred over the data and control buses (17, 18) to a respective node (11). Also, the network interfaces (15) operate as high-speed DMA controllers to transfer data between the arithmetic processor (13) and node memory (14) of a processing node (11) independent of the operation of the control processor (12) in that node (11). The control bus (17) is used to keep track of available resources throughout the architecture (10) under control of a heuristic scheduling algorithm that reallocates tasks to available arithmetic processors (13) based on a set of heuristic rules to achieve the load balancing. The data bus (18) is used to transfer data between the node memories (14) so that reallocated tasks are performed by selected arithmetic and control processors (13, 12) using data that is stored locally.

180 Citations

7 Claims

1. An expandable modular array processor architecture comprising:
- a plurality of processing nodes, wherein at least one of said plurality of processing nodes is operable to perform system startup and, wherein each processing node comprises;
  
  an arithmetic processor having an input/output port for high speed receiving of data or transmitting of data from an external source that is to be processed, and dedicated local memories, said arithmetic processor further operable to execute signal processing primitive functions;
  
  a control processor for controlling processing activity for all processors contained in the plurality of processing nodes and reallocate tasks assigned for processing in its node to available processors in other nodes based on a predetermined set of rules that are implemented by means of a heuristic task scheduling program, said control processors operable upon system startup to perform self tests and then tests of said processing nodes;
  
  a large capacity node memory that also comprises a portion of a distributed global memory, said large capacity node memory operable to store intermediate data and results; and
  
  a network interface coupled between the control processor, the arithmetic processor and the node memory;
  
  a data bus coupled between respective arithmetic processors and network interfaces of each of the plurality of processing nodes; and
  
  a control bus coupled between the respective arithmetic processors and network interfaces of each of the plurality of processing nodes;
  
  wherein respective network interfaces link the respective arithmetic processors, node memories and control processors together to provide for communication therebetween and permit each node to communicate with respective node memories of all other processing nodes to provide for load balancing therebetween, and to buffer data transferred over the data and control buses to a respective node, and to operate as high-speed DMA controllers to transfer data between the arithmetic processor and node memory of a processing node independent of the control processor in that node.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. The module array processor architecture of claim 1 wherein data is segmented among various processors in a manner that minimizes data copying and allows data operations in source and destination processors to be done locally.
  - 3. The module array processor architecture of claim 1 wherein the control bus keeps track of available processors of the architecture under control of a heuristic scheduler for reallocating tasks to available processors based on a set of heuristic rules, and wherein the data bus is used to transfer data between the node memories so that reallocated tasks are performed by selected arithmetic and control processors using data that is stored locally.
  - 4. The modular array processor architecture of claim 1 comprising a multiple-instruction, multiple-data (MIMD) architecture, having multiple scalar and vector processors interconnected with a distributed global memory.
  - 5. The modular array processor architecture of claim 1 wherein each processing node comprises a plurality of arithmetic processors and a plurality of control processors.
  - 6. The modular array processor architecture of claim 1 wherein the data bus comprises a VME bus.
  - 7. The modular array processor architecture of claim 1 wherein the network interfaces operate as high-speed DMA controllers to transfer data between the arithmetic processor and node memory of a processing node independent of the operation of the control processor in that node.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Hughes Aircraft Company (Rtx Corporation)
Original Assignee
Hughes Aircraft Company (Rtx Corporation)
Inventors
Davies, Steven P., Harrison, R. Loyd
Primary Examiner(s)
Bowler, Alyssa H.
Assistant Examiner(s)
DAVIS JR, WALTER D

Application Number

US08/553,963
Time in Patent Office

778 Days
Field of Search

395/800, 395/670, 395/672, 395/675, 395/674, 364/DIG. 1
US Class Current

718/105
CPC Class Codes

G06F 9/5088 involving task migration

Modular array processor architecture having a plurality of interconnected load-balanced parallel processing nodes

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

180 Citations

7 Claims

Specification

Solutions

Use Cases

Quick Links

Modular array processor architecture having a plurality of interconnected load-balanced parallel processing nodes

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

180 Citations

7 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links