System and method for detecting errors using CPU signature
First Claim
1. A fault tolerant computer comprising:
- two or more cpusets, wherein each said cpuset has a microprocessor configured to perform one or more operations, wherein in normal operation, said microprocessors are configured to perform said one or more operations identically and synchronously, and wherein each said microprocessor is configured to generate a signature signal which corresponds to said one or more operations;
wherein each said signature signal is representative of a start of said one or more operations; and
one or more logic circuits coupled to said cpusets, wherein said one or more logic circuits are configured to receive said signature signals of said microprocessors and to determine when said signature signals differ, and wherein when said one or more logic circuits determine that said signature signals differ, said one or more logic circuits assert one or more error signals that cause one or more of said cpusets to enter an error state.
0 Assignments
0 Petitions
Accused Products
Abstract
A fault-tolerant computer architecture is described wherein the effect of hardware faults is diminished. The architecture employs a main data bus having a plurality of interface slots for interconnecting conventional computer sub-systems. The number and type of sub-systems may vary considerably, however, a central processor sub-system which encompasses the inventive elements of the invention is always included. The central processor sub-system employs a plurality of central processing modules operating in parallel in a substantially synchronized manner. One of the central processing modules operates as a master central processing module, and is the only module capable of reading data from and writing data to the main data bus. The master central processing module is initially chosen arbitrarily from among the central processing modules. Each central processing module contains a means by which the module can compare data on the main data bus with data on a secondary bus within each module in order to determine if there is an inconsistency indicating a hardware fault. If such an inconsistency is detected, each module generates state outputs which reflect the probability that a particular module is the source of the fault. A synchronization bus which is separate from the main data bus interconnects the central processing modules and transmits the state outputs from each module to every other central processing module.
-
Citations
22 Claims
-
1. A fault tolerant computer comprising:
-
two or more cpusets, wherein each said cpuset has a microprocessor configured to perform one or more operations, wherein in normal operation, said microprocessors are configured to perform said one or more operations identically and synchronously, and wherein each said microprocessor is configured to generate a signature signal which corresponds to said one or more operations;
wherein each said signature signal is representative of a start of said one or more operations; and
one or more logic circuits coupled to said cpusets, wherein said one or more logic circuits are configured to receive said signature signals of said microprocessors and to determine when said signature signals differ, and wherein when said one or more logic circuits determine that said signature signals differ, said one or more logic circuits assert one or more error signals that cause one or more of said cpusets to enter an error state. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for determining whether two or more components in a computer system are operating synchronously, the method comprising:
-
generating a signal in each of said two or more components, said signal being representative of a start of an operation of said component;
comparing said signals generated by each of said components;
determining whether said signals generated by each of said components are identical; and
generating a response if said signals generated by each of said components are not identical.- View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
a first component configured to perform a function in said system and configured to generate a first signature signal representative of a start of said function by said first component;
a second component configured to perform said function and configured to generate a second signature signal representative of a start of said function by said second component; and
one or more logic circuits, wherein said one or more logic circuits are coupled to said first component and said second component, wherein said one or more logic circuits are configured to determine whether said first signature signal and said second signature signal are identical, and wherein said one or more logic circuits are configured to generate an error signal if said first signature signal and said second signature signal are not identical. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22)
-
Specification