Affinity-aware parallel zeroing of memory for initialization of large pages in non-uniform memory access (NUMA) servers
First Claim
Patent Images
1. A method, comprising:
- receiving, via a system call, a request for a pool of memory;
determining a size of the requested pool of memory;
creating a dummy memory segment, wherein a size of the dummy memory segment is larger than the size of the requested pool of memory;
filling the dummy memory segment with one or more pages, based on the determined size of the requested pool of memory; and
deleting the dummy memory segment, wherein deleting the dummy memory segment comprises;
sorting the one or more pages of the dummy memory segment according to each associated affinity domain of each page, wherein each affinity domain comprises a cluster of processors and memory local to the cluster of processors;
sending requests to zero the sorted one or more pages to one or more software threads that are attached to the respective affinity domain; and
waiting for a message, from the software threads in each affinity domain, indicating that all requests to zero have been processed before returning to a system caller of the system call.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for zeroing memory in computing systems where access to memory is non-uniform includes receiving, via a system call, a request for a pool of memory, determining a size of the requested pool of memory, and creating a dummy memory segment. The size of the dummy memory segment is larger than the size of the requested pool of memory. The dummy memory segment is filled with one or more pages, based on the determined size of the requested pool of memory, and the dummy memory segment is deleted.
-
Citations
10 Claims
-
1. A method, comprising:
-
receiving, via a system call, a request for a pool of memory; determining a size of the requested pool of memory; creating a dummy memory segment, wherein a size of the dummy memory segment is larger than the size of the requested pool of memory; filling the dummy memory segment with one or more pages, based on the determined size of the requested pool of memory; and deleting the dummy memory segment, wherein deleting the dummy memory segment comprises; sorting the one or more pages of the dummy memory segment according to each associated affinity domain of each page, wherein each affinity domain comprises a cluster of processors and memory local to the cluster of processors; sending requests to zero the sorted one or more pages to one or more software threads that are attached to the respective affinity domain; and waiting for a message, from the software threads in each affinity domain, indicating that all requests to zero have been processed before returning to a system caller of the system call. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method, comprising:
-
receiving, via a system call, a request for a pool of memory; determining a size of the requested pool of memory; creating a dummy memory segment, wherein a size of the dummy memory segment is larger than the size of the requested pool of memory; filling the dummy memory segment with one or more pages, based on the determined size of the requested pool of memory; and deleting the dummy memory segment, wherein deleting the dummy memory segment comprises; transmitting a request to zero the one or more pages in the dummy memory segment to an intermediate software thread; using the intermediate software thread to perform the request as a background process, comprising; sorting the one or more pages of the dummy memory segment according to each associated affinity domain of each page, each affinity domain comprising a cluster of processors and memory local to the cluster of processors; sending requests to zero the sorted one or more pages to one or more additional software threads that are attached to the respective affinity domain; and receiving, from each of the one or more additional software threads, notifications that the requests to zero the sorted one or more pages have been processed; and upon receiving a message from the intermediate software thread, returning to a system caller, while performing the request, via the intermediate software thread, continues in the background. - View Dependent Claims (7, 8, 9, 10)
-
Specification