Computer system for preventing inter-node fault propagation
First Claim
Patent Images
1. A computer system in which a plurality of computers, each including a plurality of processors, are connected to each other, said computer system comprising:
- a system controller in each said computer for, at a time of a failure within the computer system, disconnecting its own computer from another computer in which said failure has occurred, without informing said its own processors of such failure,wherein said system controller includes an inter-computer read failure detector, andwherein the inter-computer read failure detector comprises;
an inter-computer read register registering an identification of an inter-computer read issued by the system controller for any of the computers;
an inter-computer read timer measuring an elapse of a pre-specified period of time from when an inter-computer read is issued;
a dummy data reply generator for, after a timeout condition upon the elapse of the pre-determined period of time, generating a pre-defined fixed value for the computer issuing an inter-computer read for use as a temporary reply to the read; and
a dummy reply timeout setting register for registering a time elapsed before the temporary reply to the read is returned.
1 Assignment
0 Petitions
Accused Products
Abstract
In a computer system in which computers each having a plurality of processors are connected with each other, said each computer comprises a system controller for, at the time of a failure within the computer system body, disconnecting own computer from other computer in which said failure has occurred, without informing own processor of such failure.
-
Citations
17 Claims
-
1. A computer system in which a plurality of computers, each including a plurality of processors, are connected to each other, said computer system comprising:
-
a system controller in each said computer for, at a time of a failure within the computer system, disconnecting its own computer from another computer in which said failure has occurred, without informing said its own processors of such failure, wherein said system controller includes an inter-computer read failure detector, and wherein the inter-computer read failure detector comprises; an inter-computer read register registering an identification of an inter-computer read issued by the system controller for any of the computers; an inter-computer read timer measuring an elapse of a pre-specified period of time from when an inter-computer read is issued; a dummy data reply generator for, after a timeout condition upon the elapse of the pre-determined period of time, generating a pre-defined fixed value for the computer issuing an inter-computer read for use as a temporary reply to the read; and a dummy reply timeout setting register for registering a time elapsed before the temporary reply to the read is returned. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer system in which computers, each including a plurality of processors, are connected to each other, said computer system comprising:
-
in each computer, a system controller for, at a time of a failure within the computer system, disconnecting its own computer from another computer in which said failure has occurred, without informing its own processors of such failure, thereby allowing said its own processors to continue processing, wherein said system controller includes an inter-computer read failure detection means for detecting read failures between computers, the inter-computer read failure detection means comprising; an inter-computer read registering means for registering an identification of an inter-computer read issued by the system controller; an inter-computer read timer means for measuring an elapse of a pre-specified period of time from when an inter-computer read is issued; a dummy data reply generating means for, after a timeout condition upon the elapse of the pre-determined period of time, generating a pre-defined fixed value for the computer issuing an inter-computer read for use as a temporary reply to the read; a dummy reply timeout setting register means for registering a time elapsed before the temporary reply to the read is returned; an inter-computer reply detection means for detecting that a read reply data has been returned successfully from any of the computers and instructing the inter-computer read registering means to remove the registration; and a selector means for outputting selectively, one of the read reply data from any of the computers and the temporary reply to the read data, to the computer issuing the inter-computer read.
-
Specification