PROCESSORS, METHODS, AND SYSTEMS WITH A CONFIGURABLE SPATIAL ACCELERATOR
First Claim
1. An apparatus comprising:
- a first tile and a second tile, each comprising a plurality of processing elements and an interconnect network between the plurality of processing elements to receive an input of a dataflow graph comprising a plurality of nodes, wherein the dataflow graph is to be overlaid into the interconnect network and the plurality of processing elements of the first tile and the second tile with each node represented as a dataflow operator in the interconnect network and the plurality of processing elements of the first tile or the second tile, and the plurality of processing elements of the first tile and the second tile are to perform an operation when an incoming operand set arrives at the plurality of processing elements of the first tile and the second tile; and
a synchronizer circuit coupled between the interconnect network of the first tile and the interconnect network of the second tile and comprising storage to store data to be sent between the interconnect network of the first tile and the interconnect network of the second tile, the synchronizer circuit to convert the data from the storage between a first voltage or a first frequency of the first tile and a second voltage or a second frequency of the second tile to generate converted data, and send the converted data between the interconnect network of the first tile and the interconnect network of the second tile.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems, methods, and apparatuses relating to a configurable spatial accelerator are described. In one embodiment, a processor includes a synchronizer circuit coupled between an interconnect network of a first tile and an interconnect network of a second tile and comprising storage to store data to be sent between the interconnect network of the first tile and the interconnect network of the second tile, the synchronizer circuit to convert the data from the storage between a first voltage or a first frequency of the first tile and a second voltage or a second frequency of the second tile to generate converted data, and send the converted data between the interconnect network of the first tile and the interconnect network of the second tile
73 Citations
24 Claims
-
1. An apparatus comprising:
-
a first tile and a second tile, each comprising a plurality of processing elements and an interconnect network between the plurality of processing elements to receive an input of a dataflow graph comprising a plurality of nodes, wherein the dataflow graph is to be overlaid into the interconnect network and the plurality of processing elements of the first tile and the second tile with each node represented as a dataflow operator in the interconnect network and the plurality of processing elements of the first tile or the second tile, and the plurality of processing elements of the first tile and the second tile are to perform an operation when an incoming operand set arrives at the plurality of processing elements of the first tile and the second tile; and a synchronizer circuit coupled between the interconnect network of the first tile and the interconnect network of the second tile and comprising storage to store data to be sent between the interconnect network of the first tile and the interconnect network of the second tile, the synchronizer circuit to convert the data from the storage between a first voltage or a first frequency of the first tile and a second voltage or a second frequency of the second tile to generate converted data, and send the converted data between the interconnect network of the first tile and the interconnect network of the second tile. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method comprising:
-
providing a first tile and a second tile, each comprising a plurality of processing elements and an interconnect network between the plurality of processing elements, having a dataflow graph comprising a plurality of nodes overlaid into the first tile and the second tile, with each node represented as a dataflow operator in the interconnect network and the plurality of processing elements of the first tile or the second tile; storing data to be sent between the interconnect network of the first tile and the interconnect network of the second tile in storage with a synchronizer circuit coupled between the interconnect network of the first tile and the interconnect network of the second tile; converting the data from the storage between a first voltage or a first frequency of the first tile and a second voltage or a second frequency of the second tile to generate converted data with the synchronizer circuit; and sending the converted data with the synchronizer circuit between the interconnect network of the first tile and the interconnect network of the second tile. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. An apparatus comprising:
-
a first data path network between a plurality of processing elements in a first tile; a second data path network between a plurality of processing elements in a second tile; a first flow control path network between the plurality of processing elements of the first tile; a second flow control path network between the plurality of processing elements of the second tile, the first data path network, the second data path network, the first flow control path network, and the second flow control path network are to receive an input of a dataflow graph comprising a plurality of nodes, the dataflow graph is to be overlaid into the first data path network, the second data path network, the first flow control path network, the second flow control path network, the plurality of processing elements of the first tile, and the plurality of processing elements of the second tile with each node represented as a dataflow operator in the plurality of processing elements of the first tile or the plurality of processing elements of the second tile to perform an operation by a respective, incoming operand set arriving at each of the dataflow operators of the plurality of processing elements of the first tile, and the plurality of processing elements of the second tile; and a synchronizer circuit coupled between the first data path network of the first tile and the second data path network of the second tile, and comprising storage to store data to be sent between the first data path network of the first tile and the second data path network of the second tile, the synchronizer circuit to convert the data from the storage between a first voltage or a first frequency of the first tile and a second voltage or a second frequency of the second tile to generate converted data, and send the converted data between the first data path network of the first tile and the second data path network of the second tile. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A method comprising:
-
providing a first tile and a second tile having a dataflow graph comprising a plurality of nodes overlaid into a first data path network between a plurality of processing elements in the first tile, a second data path network between a plurality of processing elements in the second tile, a first flow control path network between the plurality of processing elements of the first tile, a second flow control path network between the plurality of processing elements of the second tile, the plurality of processing elements of the first tile, and the plurality of processing elements of the second tile with each node represented as a dataflow operator in the plurality of processing elements of the first tile or the plurality of processing elements of the second tile; storing data to be sent between the first data path network of the first tile and the second data path network of the second tile in storage with a synchronizer circuit coupled between the first data path network of the first tile and the second data path network of the second tile; converting the data from the storage between a first voltage or a first frequency of the first tile and a second voltage or a second frequency of the second tile to generate converted data with the synchronizer circuit; and sending the converted data with the synchronizer circuit between the first data path network of the first tile and the second data path network of the second tile. - View Dependent Claims (20, 21, 22, 23, 24)
-
Specification