PROCESSORS, METHODS, AND SYSTEMS FOR A CONFIGURABLE SPATIAL ACCELERATOR WITH TRANSACTIONAL AND REPLAY FEATURES
1 Assignment
0 Petitions
Accused Products
Abstract
Systems, methods, and apparatuses relating to a configurable spatial accelerator are described. In an embodiment, a processor includes a plurality of processing elements; and an interconnect network between the plurality of processing elements to receive an input of a dataflow graph comprising a plurality of nodes, wherein the dataflow graph is to be overlaid into the interconnect network and the plurality of processing elements with each node represented as a dataflow operator in the plurality of processing elements, and the plurality of processing elements are to perform an atomic operation when an incoming operand set arrives at the plurality of processing elements.
-
Citations
20 Claims
-
1. (canceled)
-
2. A processor comprising:
-
a plurality of processing elements; an interconnect network between the plurality of processing elements to receive an input of a dataflow graph comprising a plurality of nodes, wherein the dataflow graph is to be overlaid into the interconnect network and the plurality of processing elements with each node represented as a dataflow operator in the plurality of processing elements, the plurality of processing elements is to perform an operation when an incoming operand set arrives at the plurality of processing elements; and a transaction controller to group a plurality of memory accesses associated with the operation.
-
-
3. A processor comprising:
-
a plurality of processing elements; an interconnect network between the plurality of processing elements to receive an input of a dataflow graph comprising a plurality of nodes, wherein the dataflow graph is to be overlaid into the interconnect network and the plurality of processing elements with each node represented as a dataflow operator in the plurality of processing elements, the plurality of processing elements is to perform an operation when an incoming operand set arrives at the plurality of processing elements; and a transaction controller to group a plurality of memory accesses associated with the operation, wherein the transaction controller is to group the plurality of memory accesses into a transaction, by marking, with a transaction identifier, a cache line to be modified by the operation. - View Dependent Claims (4, 5, 6, 7, 8, 9, 10)
-
-
11. A processor comprising:
-
a plurality of processing elements; an interconnect network between the plurality of processing elements to receive an input of a dataflow graph comprising a plurality of nodes, wherein the dataflow graph is to be overlaid into the interconnect network and the plurality of processing elements with each node represented as a dataflow operator in the plurality of processing elements, the plurality of processing elements is to perform an operation when an incoming operand set arrives at the plurality of processing elements; and a cache, the cache to be included in a memory subsystem, the memory subsystem also to include a memory in which to store a plurality of old data values to replay execution from start of an epoch, the epoch to include the operation. - View Dependent Claims (12, 13, 14)
-
-
15. A method comprising:
-
receiving an input of a dataflow graph comprising a plurality of nodes; overlaying the dataflow graph into a plurality of processing elements of the processor and an interconnect network between the plurality of processing elements of the processor with each node represented as a dataflow operator in the plurality of processing elements; performing an operation of the dataflow graph with the interconnect network and the plurality of processing elements when an incoming operand set arrives the plurality of processing elements; and preserving a plurality of old data values in a memory during an epoch, the epoch including writing a new data value from one of the plurality of processing elements, the new value corresponding one of the plurality of old data values. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification