Coordinated recalibration of high bandwidth memories in a multiprocessor computer
First Claim
1. A method for providing synchronized recalibration of hardware devices in electrical communication with a memory bus in a fault-tolerant computing environment, said method comprising the steps:
- (a) providing a computer having a first and second synchronized central processing units (CPUs) and first and second hardware devices having a recalibration procedure, wherein said first hardware device is associated with said first CPU and said second hardware device is associated with said second CPU;
(b) initiating said recalibration procedure in said first and second hardware devices after the passage of a deterministically-computed delay; and
(c) generating a maintenance clock signal with a period substantially equal to the duration between iterations of said recalibration procedures, wherein the maintenance clock signal is used to initiate the deterministically-computed delay of step (b).
20 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus for implementing high-bandwidth memory subsystems in a multiprocessor computing environment. Each component in the memory subsystem has a recalibration procedure. The computer provides a low-frequency clock signal with a period substantially equal to the duration between recalibration cycles of the components of the memory subsystem. Transitions in the low-frequency clock signal initiate a deterministically-determined delay. Lapse of the delay in turn triggers the recalibration of the components of the memory subsystem, ensuring synchronous recalibration. Synchronizing the recalibration procedures minimizes the unavailability of the memory subsystems, consequently reducing voting errors between CPUs.
-
Citations
13 Claims
-
1. A method for providing synchronized recalibration of hardware devices in electrical communication with a memory bus in a fault-tolerant computing environment, said method comprising the steps:
-
(a) providing a computer having a first and second synchronized central processing units (CPUs) and first and second hardware devices having a recalibration procedure, wherein said first hardware device is associated with said first CPU and said second hardware device is associated with said second CPU;
(b) initiating said recalibration procedure in said first and second hardware devices after the passage of a deterministically-computed delay; and
(c) generating a maintenance clock signal with a period substantially equal to the duration between iterations of said recalibration procedures, wherein the maintenance clock signal is used to initiate the deterministically-computed delay of step (b). - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for providing synchronized recalibration of hardware devices in electrical communication with a memory bus in a fault-tolerant computing environment, said method comprising the steps:
-
(a) providing a computer having a first and second synchronized central processing units (CPUs) and first and second hardware devices having a recalibration procedure, wherein said first hardware device is associated with said first CPU and said second hardware device is associated with said second CPU;
(b) initiating said recalibration procedure in said first and second hardware devices after the passage of a deterministically-computed delay; and
(c) generating a system clock having a system clock period, wherein said deterministically-computed delay is an integer multiple of said system clock period.
-
-
7. A method for providing synchronized recalibration of hardware devices in electrical communication with a memory bus in a fault-tolerant computing environment, said method comprising the steps:
-
(a) providing a computer having a first and second synchronized central processing units (CPUs) and first and second hardware devices having a recalibration procedure, wherein said first hardware device is a first memory device connected to said first CPU through the memory bus and said second hardware device is a second memory device connected to said second CPU through the memory bust; and
(b) initiating said recalibration procedure in said first and second hardware devices after the passage of a deterministically-computed delay, wherein step (a) further comprises providing hardware devices that are RAMBUS memory controller hubs (MCH).
-
-
8. A method for providing synchronized recalibration of hardware devices in electrical communication with a memory bus in a fault-tolerant computing environment, said method comprising the steps:
-
(a) providing a computer having a first and second synchronized central processing units (CPUs) and first and second hardware devices having a recalibration procedure, wherein said first hardware device is a first memory device connected to said first CPU through the memory bus and said second hardware device is a second memory device connected to said second CPU through the memory bus; and
(b) initiating said recalibration procedure in said first and second hardware devices after the passage of a deterministically-computed delay, wherein step (a) further comprises providing hardware devices that are memory repeater hubs (MRH).
-
-
9. A fault-tolerant computer with synchronized memory recalibration, comprising:
-
a first central processing unit (CPU), a second CPU, in synchronized operation with said first CPU;
a first hardware device, in electrical communication with said first CPU through a memory bus and having a recalibration procedure;
a second hardware device, in electrical communication with said second CPU through a memory bus and having a recalibration procedure, wherein each of said first and second hardware devices is an integrated memory subsystem; and
a synchronizer in electrical communication with said hardware devices, operating to synchronize the execution of the recalibration procedure in said first hardware device with the execution of the recalibration procedure in said second hardware device, wherein said hardware devices are RAMBUS memory controller hubs (MCH).
-
-
10. A fault-tolerant computer with synchronized memory recalibration, comprising:
-
a first central processing unit (CPU), a second CPU, in synchronized operation with said first CPU;
a first hardware device, in electrical communication with said first CPU through a memory bus and having a recalibration procedure;
a second hardware device, in electrical communication with said second CPU through a memory bus and having a recalibration procedure, wherein each of said first and second hardware devices is an integrated memory subsystem; and
a synchronizer in electrical communication with said hardware devices, operating to synchronize the execution of the recalibration procedure in said first hardware device with the execution of the recalibration procedure in said second hardware device, wherein said hardware devices are memory repeater hubs (MRH).
-
-
11. A fault-tolerant computer with synchronized memory recalibration, comprising:
-
a first central processing unit (CPU), a second CPU, in synchronized operation with said first CPU;
a first hardware device, in electrical communication with said first CPU through a memory bus and having a recalibration procedure;
a second hardware device, in electrical communication with said second CPU through a memory bus and having a recalibration procedure;
a synchronizer in electrical communication with said hardware devices, operating to synchronize the execution of the recalibration procedure in said first hardware device with the execution of the recalibration procedure in said second hardware device; and
a clock generator in electrical communication with said synchronizer, receiving a system clock signal and generating a maintenance clock signal to initiate the recalibration procedure in said hardware devices.
-
-
12. A fault-tolerant computer with synchronized memory recalibration, comprising:
-
a first central processing unit (CPU), a second CPU, in synchronized operation with said first CPU;
a first hardware device, in electrical communication with said first CPU through a memory bus and having a recalibration procedure;
a second hardware device, in electrical communication with said second CPU through a memory bus and having a recalibration procedure;
a synchronizer in electrical communication with said hardware devices, operating to synchronize the execution of the recalibration procedure in said first hardware device with the execution of the recalibration procedure in said second hardware device; and
a temperature sensor in electrical communication with said synchronizer and thermally connected to first and second hardware devices, measuring the temperature of said hardware devices.
-
-
13. A fault-tolerant computer with synchronized memory recalibration, comprising:
-
a first central processing unit (CPU), a second CPU, in synchronized operation with said first CPU;
a first hardware device, in electrical communication with said first CPU through a memory bus and having a recalibration procedure;
a second hardware device, in electrical communication with said second CPU through a memory bus and having a recalibration procedure;
a synchronizer in electrical communication with said hardware devices, operating to synchronize the execution of the recalibration procedure in said first hardware device with the execution of the recalibration procedure in said second hardware device; and
a current sensor in electrical communication with said synchronizer and first and second hardware devices, measuring the output current of said hardware devices.
-
Specification