×

Method and system for converting a single-threaded software program into an application-specific supercomputer

  • US 10,642,588 B2
  • Filed: 10/22/2018
  • Issued: 05/05/2020
  • Est. Priority Date: 11/15/2011
  • Status: Active Grant
First Claim
Patent Images

1. A directory-mapped coherent cache memory hierarchy with a write-update protocol of a partitioned application-specific supercomputer comprising a memory including a plurality of ports and a cache, where:

  • a. whenever a store instruction is executed to a shared line in the cache, all copies of the shared line in other caches are automatically updated with the same store instruction;

    b. the automatic update is accomplished with message communication, over a plurality of scalable networks, only among caches having the shared line, and a directory unit responsible for the shared line in the caches, therefore reducing communication overhead; and

    c. a compiler guarantees, by introducing synchronization actions, that any two load/store instructions, one of which is a store instruction, which can simultaneously arrive at a cache line, are independent, therefore allowing re-ordering of load/store instructions in the plurality of scalable networks, and simplifying cache hardware;

    where the compiler automatically translates a single-threaded software program code fragment into the partitioned application-specific supercomputer functionally equivalent to the single-threaded software program code fragment, in part by creating one or more customized coherent cache memory hierarchies with the write-update protocol, and where each among the one or more customized coherent cache memory hierarchies with the write-update protocol has a minimum number of ports and data width per port for reducing memory area and power consumption.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×