Fault containment system for multiprocessor with shared memory
First Claim
1. In a digital data processing apparatus of the type havinga plurality of processor means for processing data, each processing unit including request means for generating a request signal representative of a request for access to a datum and for transmitting that request signal on a network means, a processor means generating such a request signal being referred to as a requestor,at least one memory means for storing data,said network means being coupled to said processor means and said memory means for transferring signals therebetween,the improvement whereinA. said memory means includes response means for responding to at least a selected request signal for accessing the requested datum and, absent fault, transmitting a response signal including that datum on the network means at least to the requestor,B. said response means includes fault containment means for responding to a fault in connection with at least one of accessing and transmitting the requested datum for generating said response signal to include a fault signal for preventing access by at least said requester to that datum.
1 Assignment
0 Petitions
Accused Products
Abstract
A multiprocessor system has a plurality of processing cells, each including a processor and memory, interconnected via a network. The memories respond to requests by the processors for accessing data and, absent fault, transmitting it in response packets to at least to the requesting processors. A fault containment element responds to at least certain faults during access or transmission of a datum for including within the respective response packet a fault signal that prevents the requestor from accessing the datum. If a fault is detected in a datum not previously detected as faulty, a marking element can include a "marked fault" signal in the response packet. Whereas, it can include an "unmarked fault" signal when it detects a fault associated with a requested datum, but not specifically isolated to that datum. When a request is made for a datum which had previously been detected as faulty, the marking element can include in the response packet a "descriptor fault" signal. This facilitates identification of a particular source of an error and prevents that error from propagating to other processing cells.
128 Citations
35 Claims
-
1. In a digital data processing apparatus of the type having
a plurality of processor means for processing data, each processing unit including request means for generating a request signal representative of a request for access to a datum and for transmitting that request signal on a network means, a processor means generating such a request signal being referred to as a requestor, at least one memory means for storing data, said network means being coupled to said processor means and said memory means for transferring signals therebetween, the improvement wherein A. said memory means includes response means for responding to at least a selected request signal for accessing the requested datum and, absent fault, transmitting a response signal including that datum on the network means at least to the requestor, B. said response means includes fault containment means for responding to a fault in connection with at least one of accessing and transmitting the requested datum for generating said response signal to include a fault signal for preventing access by at least said requester to that datum.
-
7. A digital data processing apparatus comprising
A. one or more processor means for generating requests for access to data stored in a plurality of memory means, each said memory means for storing data, B. at least a responding one of which memory means comprises means responsive to a selected request for accessing the requested datum in that memory means and for generating a digital signal response packet inclusive thereof, C. network means coupled to said plural memory means for transferring said digital signal packet therebetween, D. at least a target one of which memory means includes means normally responsive to a selected response packet for storing the datum contained therein, E. fault-detection means coupled to said memory means and to said network means for detecting at least a selected fault in connection with the access of said datum and the transfer thereof in said digital signal packet, and F. said fault-detection means including marking means responsive to such fault condition for including in that digital packet a signal representative thereof.
-
19. A method of operating a digital data processing apparatus of the type having
a plurality of processor means for processing data, each processing unit including request means for generating a request signal representative of a request for access to a datum and for transmitting that request signal on a network means, a processor means generating such a request signal being referred to as a requestor, at least one memory means for storing data, said network means being coupled to said processor means and said memory means for transferring signals therebetween, the method comprising the steps of A. responding to at least a selected request signal for accessing the requested datum in said memory means and, absent fault, transmitting a response signal including that datum on the network means at least to the requester, B. responding to a fault in connection with at least one of accessing and transmitting the requested datum for generating said response signal to include a fault signal for preventing access by at least said requester to that datum.
-
25. A method of operation a digital data processing apparatus comprising
A. providing one or more processor means for generating requests for access to data stored in a plurality of memory means, each said memory means for storing data, B. responding to a selected request for accessing a requested datum in a memory means and generating a digital signal response packet inclusive of that datum, C. transferring said digital signal packet between said plural memory means, D. normally responding to a selected response packet for storing the datum contained therein in another memory means, E. detecting at least a selected fault in connection with the access of said datum and the transfer thereof in said digital signal packet, and F. responding to such fault condition for including in that digital packet a signal representative thereof.
Specification