PROCESSORS AND METHODS FOR CONFIGURABLE CLOCK GATING IN A SPATIAL ARRAY
First Claim
1. A processor comprising:
- a plurality of processing elements;
an interconnect network between the plurality of processing elements to receive an input of a dataflow graph comprising a plurality of nodes, wherein the dataflow graph is to be overlaid into the interconnect network and the plurality of processing elements with each node represented as a dataflow operator in the interconnect network and the plurality of processing elements, and the plurality of processing elements is to perform an operation when an incoming operand set arrives at the plurality of processing elements; and
a configuration controller coupled to the plurality of processing elements to configure the plurality of processing elements according to configuration information for the dataflow graph, and clock gate at least one clocked component of a processing element based on the configuration information.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and apparatuses relating to configurable clock gating in spatial arrays are described. In one embodiment, a processor includes processing elements; an interconnect network between the processing elements; and a configuration controller, coupled to a first processing element and a second processing element of the plurality of processing elements and the first processing element having an output coupled to an input of the second processing element, to configure the second processing element to clock gate at least one clocked component of the second processing element, and configure the first processing element to send a reenable signal on the interconnect network to the second processing element to reenable the at least one clocked component of the second processing element when data is to be sent from the first processing element to the second processing element.
-
Citations
24 Claims
-
1. A processor comprising:
-
a plurality of processing elements; an interconnect network between the plurality of processing elements to receive an input of a dataflow graph comprising a plurality of nodes, wherein the dataflow graph is to be overlaid into the interconnect network and the plurality of processing elements with each node represented as a dataflow operator in the interconnect network and the plurality of processing elements, and the plurality of processing elements is to perform an operation when an incoming operand set arrives at the plurality of processing elements; and a configuration controller coupled to the plurality of processing elements to configure the plurality of processing elements according to configuration information for the dataflow graph, and clock gate at least one clocked component of a processing element based on the configuration information. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method comprising:
-
configuring, with a configuration controller of a processor, a plurality of processing elements of the processor according to configuration information for a dataflow graph, wherein the processor comprises the plurality of processing elements and an interconnect network between the plurality of processing elements, and has the dataflow graph comprising a plurality of nodes overlaid into the plurality of processing elements of the processor and the interconnect network between the plurality of processing elements of the processor with each node represented as a dataflow operator in the interconnect network and the plurality of processing elements; clock gating, with the configuration controller of the processor, at least one clocked component of a processing element based on the configuration information for the dataflow graph; and performing an operation of the dataflow graph with the interconnect network and the plurality of processing elements when an incoming operand set arrives at the plurality of processing elements. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A processor comprising:
-
a plurality of processing elements; an interconnect network between the plurality of processing elements to receive an input of a dataflow graph comprising a plurality of nodes, wherein the dataflow graph is to be overlaid into the interconnect network and the plurality of processing elements with each node represented as a dataflow operator in the interconnect network and the plurality of processing elements, and the plurality of processing elements is to perform an operation when an incoming operand set arrives at the plurality of processing elements; and a configuration controller, coupled to a first processing element and a second processing element of the plurality of processing elements and the first processing element having an output coupled to an input of the second processing element, to configure the second processing element to clock gate at least one clocked component of the second processing element, and configure the first processing element to send a reenable signal on the interconnect network to the second processing element to reenable the at least one clocked component of the second processing element when data is to be sent from the first processing element to the second processing element. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A method comprising:
-
configuring, with a configuration controller of a processor coupled to a first processing element and a second processing element of a plurality of processing elements and the first processing element having an output coupled to an input of the second processing element, the second processing element to clock gate at least one clocked component of the second processing element, wherein the processor comprises the plurality of processing elements and an interconnect network between the plurality of processing elements, and has a dataflow graph comprising a plurality of nodes overlaid into the plurality of processing elements of the processor and the interconnect network between the plurality of processing elements of the processor with each node represented as a dataflow operator in the interconnect network and the plurality of processing elements; configuring, with the configuration controller, the first processing element to send a reenable signal on the interconnect network to the second processing element to reenable the at least one clocked component of the second processing element when data is to be sent from the first processing element to the second processing element; clock gating, with the configuration controller of the processor, the at least one clocked component of the second processing element; sending, with the first processing element, a reenable signal on the interconnect network to the second processing element to reenable the at least one clocked component of the second processing element when the data is sent from the first processing element to the second processing element; and performing an operation of the dataflow graph with the second processing element when an incoming operand set including the data arrives at the second processing element. - View Dependent Claims (20, 21, 22, 23, 24)
-
Specification