Optimization of loops and data flow sections in multi-core processor environment
First Claim
Patent Images
1. A method for executing a program on a processor the method comprising:
- scheduling by a scheduler available processing hardware resources for executing at least one thread on at least one of a plurality of cores included in the processor,each core having a multi-dimensional array of execution units and a register file, wherein the at least one thread is included in a plurality of threads partitioned from the program;
mapping the at least one thread for execution onto the hardware resources allocated by the scheduler;
configuring the interconnection between the execution units and the register file within one or more of the plurality of cores; and
creating one or more threads by the processor itself;
wherein creating one or more threads by the processor itself includes analyzing code by the processor.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention relates to a method for compiling code for a multi-core processor, comprising: detecting and optimizing a loop, partitioning the loop into partitions executable and mappable on physical hardware with optimal instruction level parallelism, optimizing the loop iterations and/or loop counter for ideal mapping on hardware, chaining the loop partitions generating a list representing the execution sequence of the partitions.
65 Citations
7 Claims
-
1. A method for executing a program on a processor the method comprising:
-
scheduling by a scheduler available processing hardware resources for executing at least one thread on at least one of a plurality of cores included in the processor, each core having a multi-dimensional array of execution units and a register file, wherein the at least one thread is included in a plurality of threads partitioned from the program; mapping the at least one thread for execution onto the hardware resources allocated by the scheduler; configuring the interconnection between the execution units and the register file within one or more of the plurality of cores; and creating one or more threads by the processor itself; wherein creating one or more threads by the processor itself includes analyzing code by the processor. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
Specification