Fault resilient/fault tolerant computing
First Claim
Patent Images
1. A fault tolerant/fault resilient computer system, comprising:
- at least two compute elements; and
at least one controller connected to the at least two compute elements;
wherein;
each of the compute elements has clocks that operate asynchronously to clocks of the other compute elements, the compute elements each execute a stream of instructions, upon issuing an I/O request instruction from the stream of instructions, each compute element enters a halted state, and each compute element is removed from the halted state by an interrupt generated upon completion of the I/O request instruction.
0 Assignments
0 Petitions
Accused Products
Abstract
A fault tolerant/fault resilient computer system includes at least two compute elements connected to at least one controller. Each compute element has clocks that operate asynchronously to clocks of the other compute elements. The compute elements operate in a first mode in which the compute elements each execute a first stream of instructions in emulated clock lockstep, and in a second mode in which the compute elements each execute a second stream of instructions in instruction lockstep. Each compute element may be a multi-processor compute element.
-
Citations
14 Claims
-
1. A fault tolerant/fault resilient computer system, comprising:
-
at least two compute elements; and
at least one controller connected to the at least two compute elements;
wherein;
each of the compute elements has clocks that operate asynchronously to clocks of the other compute elements, the compute elements each execute a stream of instructions, upon issuing an I/O request instruction from the stream of instructions, each compute element enters a halted state, and each compute element is removed from the halted state by an interrupt generated upon completion of the I/O request instruction. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
copying contents of the memory of a first compute element to the memory of a second compute element; and
resetting the processors of the first and second compute elements without affecting the memories of the compute elements.
-
-
10. The computer system of claim 1, wherein the system is configured to redirect I/O operations by the compute elements to the controller.
-
11. The computer system of claim 1, further comprising a second controller connected to the first controller and to the at least two compute elements.
-
12. The computer system of claim 11, wherein the first controller and a first one of the compute elements are located in a first location and the second controller and a second one of the compute elements are located in a second location, and further comprising a communications link connecting the first controller to the second controller, the first controller to the second one of the compute elements, and the second controller to the first one of the compute elements.
-
13. The computer system of claim 12, wherein the first location is spaced from the second location by more than 5 meters.
-
14. The computer system of claim 12, wherein the first location is spaced from the second location by more than 100 meters.
Specification