NUMA system with redundant main memory architecture
First Claim
1. A method of managing data in a multi-node data processing system having N nodes, wherein each node includes a corresponding main memory, comprising:
- storing new data in a first node main memory of the data processing system;
calculating parity information based on the new data and data stored in the main memory of at least one other node;
storing the parity information in the main memory of a node other than the first node.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for managing data in a data processing system are disclosed. Initially, data is stored in a first portion of the main memory of the system. Responsive to storing the data in the first portion of main memory, information is then stored in a second portion of the main memory. The information stored in the second portion of main memory is indicative of the data stored in the first portion. In an embodiment in which the data processing system is implemented as a multi-node system such as a NUMA system, the first portion of the main memory is in the main memory of a first node of system and the second portion of the main memory is in the main memory of a second node of the system. In one embodiment, storing information in the second portion of the main memory is achieved by storing a copy of the data in the second portion. If a fault in the first portion of the main memory is detected, the information in the second main memory portion is retrieved and stored to a persistent storage device. In another embodiment, storing information in the second portion of the main memory includes calculating a value based on the corresponding contents of other portions of the main memory using an algorithm such as checksum, parity, or ECC, and storing the calculated value in the second portion. In one embodiment, the main memory of at least one of the nodes is connectable to a persistent source of power, such as a battery, such that the main memory contents may be preserved if system power is disabled.
1400 Citations
13 Claims
-
1. A method of managing data in a multi-node data processing system having N nodes, wherein each node includes a corresponding main memory, comprising:
-
storing new data in a first node main memory of the data processing system;
calculating parity information based on the new data and data stored in the main memory of at least one other node;
storing the parity information in the main memory of a node other than the first node. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A data processing system comprising:
-
a first node including a first node main memory and a first processor;
a second node including a second node main memory and a second processor, a third node including a third node main memory and a third processor;
a fabric connecting the first, second, and third nodes; and
a persistent storage device;
wherein the system is configured, upon storing new data in the first node main memory, to store error correction data in the main memory of at least one of the other nodes wherein the error correction data is determined based, at least in part, upon the new data and the data stored in the main memory of the remaining node. - View Dependent Claims (9, 10, 11)
-
-
12. A multi-node, non-uniform memory architecture (NUMA) data processing system, wherein each node includes a processor and a corresponding main memory, comprising:
-
means for dividing the main memory of each node into N segments, where N is the number of nodes in to system;
means for allocating one segment of each node for storing error correction data, wherein the error correction data is indicative of data in the corresponding segments of the other main memories;
means, responsive to storing new data in a first segment of a first node main memory for determining error correction information based on the new data; and
means for staring the determined error correction information in the first segment of another node. - View Dependent Claims (13)
-
Specification