Homogeneous recovery in a redundant memory system
First Claim
1. A computer implemented method for performing recovery, the method comprising:
- receiving a notification that a memory channel has failed, the memory channel one of a plurality of memory channels in a memory system;
preventing new operations from starting on the memory channels based on the notification;
shutting down the failing memory channel and ignoring any pending stores and fetches in the failing memory channel;
based on shutting down the failing memory channel, completing any in-progress fetches only on a portion of the memory channels, the portion of the memory channels consisting of non-failing channels;
performing a recovery operation on the memory channels based on the completing of any in-progress fetches, the recovery operation comprising resetting the memory channels and performing data calibration on at least a second subset of the memory channels in the memory system, wherein any other memory channels in the memory system that are not in the second subset are idle during the performing of the data calibration;
retrying any pending stores that were issued prior to receiving notification of the failing memory channel after performing the recovery operation; and
allowing the new operations to start on at least a first subset of the memory channels based on completion of the recovery operation, the memory system configured to operate with the first subset of the memory channels.
7 Assignments
0 Petitions
Accused Products
Abstract
A computer implemented method for providing homogeneous recovery in a redundant memory system. The method includes receiving a notification that a memory channel has failed, where the memory channel is one of a plurality of memory channels in a memory system. New operations are blocked from starting on the memory channels in response to the notification, and any pending operations on the memory channels are completed in response to the notification. A recovery operation is performed on the memory channels in response to the completing. The new operations are started on at least a first subset of the memory channels in response to the recovery operation completing. The memory system is configured to operate with the first subset of the memory channels.
-
Citations
6 Claims
-
1. A computer implemented method for performing recovery, the method comprising:
-
receiving a notification that a memory channel has failed, the memory channel one of a plurality of memory channels in a memory system; preventing new operations from starting on the memory channels based on the notification; shutting down the failing memory channel and ignoring any pending stores and fetches in the failing memory channel; based on shutting down the failing memory channel, completing any in-progress fetches only on a portion of the memory channels, the portion of the memory channels consisting of non-failing channels; performing a recovery operation on the memory channels based on the completing of any in-progress fetches, the recovery operation comprising resetting the memory channels and performing data calibration on at least a second subset of the memory channels in the memory system, wherein any other memory channels in the memory system that are not in the second subset are idle during the performing of the data calibration; retrying any pending stores that were issued prior to receiving notification of the failing memory channel after performing the recovery operation; and allowing the new operations to start on at least a first subset of the memory channels based on completion of the recovery operation, the memory system configured to operate with the first subset of the memory channels. - View Dependent Claims (2, 3, 4, 5, 6)
-
Specification