×

Apparatus and method for memory-hierarchy aware producer-consumer instruction

  • US 9,990,287 B2
  • Filed: 12/21/2011
  • Issued: 06/05/2018
  • Est. Priority Date: 12/21/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method for transferring a chunk of data from a core of a central processing unit (CPU) to a graphics processing unit (GPU), comprising:

  • executing a first instruction, the first instruction being a single instruction, wherein the first instruction comprises a MovNonAllocate store instruction, the executing comprising;

    responsive to the first instruction,writing data, without caching the data, to a buffer within the core of the CPU until a designated amount of data has been written, wherein the buffer combines multiple stores until the designated amount of data has been written, andupon detecting that the designated amount of data has been written, responsively generating an eviction cycle, the eviction cycle causing the data to be transferred from the buffer to a cache shared by both the core and the GPU, wherein the cache is a level 3 cache;

    setting an indication to indicate to the GPU that data is available in the cache; and

    upon the GPU detecting the indication, providing the data to the GPU from the cache upon receipt of a read signal from the GPU.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×