×

Processors, methods, and systems with a configurable spatial accelerator

  • US 10,515,046 B2
  • Filed: 07/01/2017
  • Issued: 12/24/2019
  • Est. Priority Date: 07/01/2017
  • Status: Active Grant
First Claim
Patent Images

1. An apparatus comprising:

  • a first tile and a second tile, each comprising a plurality of processing elements and an interconnect network between the plurality of processing elements to receive an input of a dataflow graph comprising a plurality of nodes, wherein the dataflow graph is to be overlaid into the interconnect network and the plurality of processing elements of the first tile and the second tile with each node represented as a dataflow operator in the interconnect network and the plurality of processing elements of the first tile or the second tile, and the plurality of processing elements of the first tile and the second tile are to perform an operation when an incoming operand set arrives at the plurality of processing elements of the first tile and the second tile;

    a synchronizer circuit coupled between the interconnect network of the first tile and the interconnect network of the second tile and comprising storage to store data to be sent between the interconnect network of the first tile and the interconnect network of the second tile, the synchronizer circuit to convert the data from the storage between a first voltage or a first frequency of the first tile and a second voltage or a second frequency of the second tile to generate converted data, and send the converted data between the interconnect network of the first tile and the interconnect network of the second tile; and

    one of;

    a second synchronizer circuit coupled between the interconnect network of the first tile and the interconnect network of the second tile and comprising storage to store second data to be sent from the interconnect network of the second tile into the interconnect network of the first tile, the second synchronizer circuit to convert the second data from the storage from the second voltage or the second frequency of the second tile to the first voltage or the first frequency of the first tile to generate second converted data, and send the second converted data into the interconnect network of the first tile, wherein the synchronizer circuit is coupled between the interconnect network of the first tile and the interconnect network of the second tile and comprises storage to store data to be sent from the interconnect network of the first tile into the interconnect network of the second tile, the synchronizer circuit to convert the data from the storage from the first voltage or the first frequency of the first tile to the second voltage or the second frequency of the second tile to generate the converted data, and send the converted data into the interconnect network of the second tile,orwherein the synchronizer circuit is to send a backpressure signal from a downstream processing element of the second tile to a processing element of the first tile to stall execution of the processing element of the first tile, wherein the backpressure signal indicates that storage in the downstream processing element is not available for an output of the processing element.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×