Multiprocessor system having distributed shared memory and instruction scheduling method used in the same system

US 6,892,280 B2
Filed: 06/18/2002
Issued: 05/10/2005
Est. Priority Date: 11/07/2001
Status: Expired due to Term

First Claim

Patent Images

1. A multiprocessor system having distributed shared memory, comprising a plurality of nodes connected via an inter-node interface, each of the nodes including at least one processor, a main storage controller, and a main storage belonging to the distributed shared memory with an individual memory address, whereineach processor comprises instruction executing means for executing an NUMA prefetch instruction, the instruction executing means includes an address determiner which terminates the processing of the NUMA prefect instruction without conducting a prefetch operation if an address specified in the NUMA prefetch instruction is associated with a local node, to which the processor belongs, and which issues, only if the address is associated with a remote node other than the local node, a prefetch request to a main storage controller of the local node.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

In multiprocessing system executing processing called NUMA prefetch, when a prefetch instruction is issued to a prefetch unit, an address converter converts an address specified by an operand of the instruction into a physical address. A prefetch type determiner determines whether the instruction is an NUMA prefetch instruction or a conventional perfect prefetch instruction. If the instruction is an NUMA prefetch instruction, an address determiner determines whether the physical address is a local address or a remote address. If the address is a local address, the processing of the prefetch instruction is terminated. If the address is a remote address, a cache tag checker checks a cache. When cache hit occurs, the processing is terminated. When cache mishit occurs, a prefetch request is issued to a main storage controller. As a result, data is prefetched from a remote main storage to a cache in a local main storage.

Citations

4 Claims

1. A multiprocessor system having distributed shared memory, comprising a plurality of nodes connected via an inter-node interface, each of the nodes including at least one processor, a main storage controller, and a main storage belonging to the distributed shared memory with an individual memory address, whereineach processor comprises instruction executing means for executing an NUMA prefetch instruction, the instruction executing means includes an address determiner which terminates the processing of the NUMA prefect instruction without conducting a prefetch operation if an address specified in the NUMA prefetch instruction is associated with a local node, to which the processor belongs, and which issues, only if the address is associated with a remote node other than the local node, a prefetch request to a main storage controller of the local node.
- View Dependent Claims (2)
- - 2. A multiprocessor system having distributed shared memory according to claim 1, wherein the main storage controller of the local node comprises means for reading, when the prefetch request is received, data via a main storage controller of the remote node from a main storage of the remote node, the data being at the address specified in the NUMA prefetch instruction and for storing the data in a cache disposed in a main storage of the local node.

3. An instruction scheduling method for use in a multiprocessor system having distributed shared memory and comprising a plurality of nodes connected via an inter-node interface, each of the nodes including at least one processor each of which includes a first level cache, a main storage controller, a main storage belonging to the distributed shared memory with an individual memory address, and a second level cache, the method being used in each of the processor, whereineach of the processor issues a first prefetch instruction specifying an address block containing data to be used by the processor, the first prefetch instruction indicating that when the address specifies a local node to which the processor belongs, the processing of the first prefetch instruction is terminated without conducting a prefetch operation, and only when the address specifies a remote node other than the local note, the data is prefetched from the remote node to the second level cache of the local node, and the processor then issues a second prefetch instruction indicating an address block equal to the address block specified in the first prefetch instruction, the second prefetch instruction indicating that the data is prefetched to the first level cache.

4. An instruction scheduling method for use in a multiprocessor system having distributed shared memory and comprising a plurality of nodes connected via an inter-node interface, the node including one or more processors each of which includes a first level cache, a main storage controller, a main storage, and a second level cache, the method being used in each of the processors,wherein each of the processors issues a first prefetch instruction specifying an address block containing data to be used by the processor, the first prefetch instruction indicating that when the address specifies a local node, a prefetch operation is not conducted, and only when the address specifies a remote node, the data is prefetched from the remote node to the second level cache of the local node, wherein the processor then issues a second prefetch instruction indicating an address block equal to the address block specified in the first prefetch instruction, the second prefetch instruction indicating that the data is prefetched to the first level cache, and wherein the processor issues, after the first prefetch instruction for the address block, the second prefetch instruction for the address block when a period of time fully lapses for termination of execution the first prefetch instruction.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Hitachi, Ltd.
Original Assignee
Hitachi, Ltd.
Inventors
Nakamura, Takaki
Primary Examiner(s)
Padmanabhan, Mano
Assistant Examiner(s)
HO, THANG H

Application Number

US10/173,105
Publication Number

US 20030088636A1
Time in Patent Office

1,057 Days
Field of Search

711/137, 711/145, 711/119, 711147-148, 707/207
US Class Current

711/137
CPC Class Codes

G06F 12/0813   with a network or matrix co...

G06F 12/0862   with prefetch

G06F 2212/2542   Non-uniform memory access [...

G06F 2212/6028   Prefetching based on hints ...

G06F 9/383   Operand prefetching cache p...

Multiprocessor system having distributed shared memory and instruction scheduling method used in the same system

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

4 Claims

Specification

Solutions

Use Cases

Quick Links

Multiprocessor system having distributed shared memory and instruction scheduling method used in the same system

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

4 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links