Managing sparse directory evictions in multiprocessor systems via memory locking
First Claim
1. A computer system, comprising a plurality of processor clusters, each cluster including a plurality of local nodes and a cache coherence controller interconnected by a local point-to-point architecture, the computer system further comprising memory corresponding to a global memory space, each cluster corresponding to a contiguous portion of the global memory space, selected ones of the plurality of local nodes in each cluster having a memory controller associated therewith, each memory controller in each cluster being responsible for a memory range within the corresponding contiguous portion of the global memory space, the cache coherence controller in each cluster having a cache coherence directory associated therewith, entries in the cache coherence directory in each cluster corresponding to memory lines within the corresponding contiguous portion of the global memory space that are cached in remote clusters, the cache coherence controller being operable to facilitate an eviction of a first one of the entries corresponding to a copy of a first memory line by sending a lock request identifying the first memory line to a first one of the memory controllers corresponding to the first memory line.
11 Assignments
0 Petitions
Accused Products
Abstract
Cache coherence directory eviction mechanisms are described for use in computer systems having a plurality of multiprocessor clusters. Interaction among the clusters is facilitated by a cache coherence controller in each cluster. A cache coherence directory is associated with each cache coherence controller identifying memory lines associated with the local cluster that are cached in remote clusters. Techniques are provided for managing eviction of entries in the cache coherence directory by locking memory lines in a home cluster without causing a memory controller to generate probes to processors in the home cluster.
60 Citations
30 Claims
- 1. A computer system, comprising a plurality of processor clusters, each cluster including a plurality of local nodes and a cache coherence controller interconnected by a local point-to-point architecture, the computer system further comprising memory corresponding to a global memory space, each cluster corresponding to a contiguous portion of the global memory space, selected ones of the plurality of local nodes in each cluster having a memory controller associated therewith, each memory controller in each cluster being responsible for a memory range within the corresponding contiguous portion of the global memory space, the cache coherence controller in each cluster having a cache coherence directory associated therewith, entries in the cache coherence directory in each cluster corresponding to memory lines within the corresponding contiguous portion of the global memory space that are cached in remote clusters, the cache coherence controller being operable to facilitate an eviction of a first one of the entries corresponding to a copy of a first memory line by sending a lock request identifying the first memory line to a first one of the memory controllers corresponding to the first memory line.
- 15. A cache coherence controller for use in a computer system comprising a plurality of processor clusters, each cluster including a plurality of local nodes and an instance of the cache coherence controller interconnected by a local point-to-point architecture, the computer system further comprising memory corresponding to a global memory space, each cluster corresponding to a contiguous portion of the global memory space, selected ones of the plurality of local nodes in each cluster having a memory controller associated therewith, each memory controller in each cluster being responsible for a memory range within the corresponding contiguous portion of the global memory space, the cache coherence controller including a cache coherence directory, entries in the cache coherence directory in each cluster corresponding to memory lines within the corresponding contiguous portion of the global memory space which are cached in remote clusters, the cache coherence controller being operable to facilitate an eviction of a first one of the entries corresponding to a copy of a first memory line by sending a lock request identifying the first memory line to a first one of the memory controllers corresponding to the first memory line.
-
24. A computer-implemented method for evicting entries in a cache coherence directory, the cache coherence directory being associated with a computer system comprising a plurality of processor clusters, each cluster including a plurality of local nodes and a cache coherence controller interconnected by a local point-to-point architecture, the computer system further comprising memory corresponding to a global memory space, each cluster corresponding to a contiguous portion of the global memory space, selected ones of the plurality of local nodes in each cluster having a memory controller associated therewith, each memory controller in each cluster being responsible for a memory range within the corresponding contiguous portion of the global memory space, the cache coherence controller in a first cluster having the cache coherence directory associated therewith, entries in the cache coherence directory corresponding to memory lines within the contiguous portion of the global memory space corresponding to the first cluster which are cached in remote clusters, the method comprising:
-
determining that a first one of the entries in the cache coherence directory should be evicted, the first entry corresponding to a copy of a first memory line;
generating a lock request identifying the first memory line directed to a first one of the memory controllers corresponding to the first memory line;
in response to the lock request, locking the first memory line and generating a locked response to the cache coherence controller in the first cluster;
invalidating the first entry in the cache coherence directory in response to the locked response received by the cache coherence controller;
sending a probe to a remote cluster having at least one copy of the first memory line; and
invalidating all copies of the first memory line in cache memories of nodes in the remote cluster. - View Dependent Claims (25, 26, 27, 28, 29, 30)
-
Specification