Multiprocessing computer system employing a cluster communication error reporting mechanism
First Claim
1. A multiprocessing computer system comprising a plurality of processing nodes and a global bus network interconnecting said plurality of processing nodes, wherein a first node includes:
- a plurality of processors;
a memory coupled to said plurality of processors through a local bus; and
a system interface for receiving transactions initiated by said plurality of processors on said local bus which are destined to remote nodes, wherein said system interface includes a plurality of error status registers configured to store information regarding errors associated with transactions conveyed upon said global bus network, wherein a separate error status register is provided for each of said plurality of processors, and wherein said system interface includes a request agent coupled to said plurality of error status registers.
2 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, a multiprocessing computer system includes a plurality of nodes. The plurality of nodes may be interconnected through a global interconnect network which supports cluster communications. An initiating node may launch a request to a remote nodes memory. In the event of an error, an error status register of a system interface of the launching cluster node is set to indicate the occurrence of an error. The error may be the result of an access violation, or the result of a time-out occurrence in either the remote node or the initiating node. Various other errors may alternatively be reported. The system interface advantageously includes a plurality of error status registers, with a separate error status register provided for each processor included in the node. A process running on any of the processors of the node reads an error by issuing a transaction to a unique address, wherein the unique address is independent of the processor upon which the process is running. The unique address as well as the transaction ID indicative of the processor which is attempting to read an error status are used by the system interface to determine which of the plurality of error status registers to access.
62 Citations
21 Claims
-
1. A multiprocessing computer system comprising a plurality of processing nodes and a global bus network interconnecting said plurality of processing nodes, wherein a first node includes:
-
a plurality of processors;
a memory coupled to said plurality of processors through a local bus; and
a system interface for receiving transactions initiated by said plurality of processors on said local bus which are destined to remote nodes, wherein said system interface includes a plurality of error status registers configured to store information regarding errors associated with transactions conveyed upon said global bus network, wherein a separate error status register is provided for each of said plurality of processors, and wherein said system interface includes a request agent coupled to said plurality of error status registers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for communicating errors in a multiprocessing system operating in a cluster mode including a plurality of processing nodes, said method comprising:
-
a first cluster node sending a request to a second cluster node;
said second cluster node replying to said first cluster node with an error; and
said first cluster node storing an indication of said error within a particular error status register, wherein said first cluster node includes a plurality of error status registers wherein a separate error status register is provided for each processor within said first cluster node, and wherein said system interface includes a request agent coupled to said plurality of error status registers, and wherein said particular error status register is dependent upon a particular processor within said first cluster node which initiated said request. - View Dependent Claims (13, 14, 15, 16)
a process executing on a first processor of said first cluster node initiating a read operation to a unique error status register address to determine the existence of an error; and
using said unique address in addition to a transaction ID which identifies said first processor as issuing said read operation to determine said particular error status register to read.
-
-
17. A multiprocessing system including a plurality of processing nodes which operate in a cluster mode, said multiprocessing system comprising:
-
a first cluster node including means for sending a request to a second cluster node;
said second cluster node including means for replying to said first cluster node with an error; and
means for storing an indication of said error within a particular error status register;
wherein said first cluster node further includes a plurality of error status registers wherein a separate error status register is provided for each processor within said first cluster node, and wherein said first cluster node includes a request agent coupled to said plurality of error status registers, and wherein said particular error status register is dependent upon a particular processor within said first cluster node which initiated said request. - View Dependent Claims (18, 19, 20, 21)
a process executing on a first processor of said first cluster node, wherein said process initiates a read operation to a unique error status register address to determine the existence of an error; and
means for selecting a particular error status register to read depending upon said unique address in addition to a transaction ID which identifies said first processor as initiating said read operation.
-
Specification