SYSTEM AND METHOD FOR PROVIDING A HIGH FAULT TOLERANT MEMORY SYSTEM
First Claim
1. A memory system comprising:
- a memory controller;
a plurality of memory modules in communication with the memory controller and with a plurality of memory devices, the plurality of memory devices including at least one spare memory device for providing memory device sparing capability; and
a mechanism for detecting that one of the memory modules has failed possibly coincident with a memory device failure on an other of the memory modules, and for allowing the memory system to continue to run unimpaired in the presence of the memory module failure and the possible memory device failure.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method for providing a high fault tolerant memory system. The system includes a memory system having a memory controller, a plurality of memory modules and a mechanism. The plurality of memory modules are in communication with the memory controller and with a plurality of memory devices. The plurality of memory devices include at least one spare memory device for providing memory device sparing capability. The mechanism is for detecting that one of the memory modules has failed possibly coincident with a memory device failure on an other of the memory modules. The mechanism allows the memory system to continue to run unimpaired in the presence of the memory module failure and the possible memory device failure.
-
Citations
20 Claims
-
1. A memory system comprising:
-
a memory controller; a plurality of memory modules in communication with the memory controller and with a plurality of memory devices, the plurality of memory devices including at least one spare memory device for providing memory device sparing capability; and a mechanism for detecting that one of the memory modules has failed possibly coincident with a memory device failure on an other of the memory modules, and for allowing the memory system to continue to run unimpaired in the presence of the memory module failure and the possible memory device failure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A memory controller comprising:
-
an interface to a plurality of memory modules, the modules in communication with a plurality of memory devices, the plurality of memory devices including at least one spare memory device for providing memory device sparing capability; and a mechanism for detecting that one of the memory modules has failed possibly coincident with a memory device failure on an other of the memory modules, and for allowing the memory system to continue to run unimpaired in the presence of the memory module failure and the possible memory device failure. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A method for detecting and correcting errors in a memory system, the method comprising:
-
determining if only a single symbol error exists in the memory system, the memory system including a plurality of memory modules having memory devices including at least one spare memory device for providing memory device sparing capability, the memory modules accessed in unison in response to memory commands; correcting the single symbol error in response to determining that the single symbol error is the only error that exists in the memory system; correcting for a possible memory module failure using the RAID-3 error correction code in response to determining that the error is not a single symbol error; and correcting for errors remaining in the memory system after the correcting for the possible memory module failure. - View Dependent Claims (17, 18, 19, 20)
-
Specification