Assembly code performance evaluation apparatus and method
First Claim
Patent Images
1. An assembly code performance evaluation apparatus, comprising:
- a host computer having processing circuitry, memory and a host compiler, the host compiler operative to execute the program using test sequences and generate dynamic information;
a target digital signal processor (DSP) compiler communicating with the processing circuitry; and
a performance estimation program implemented on the host processing circuitry and operative to annotate application source code, and generate an estimation of an optimized assembly code;
wherein the performance estimation program includes a plurality of rewriting rules that are applied to an RTL intermediate representation and wherein one of the plurality of rewriting rules is operative to remove a shift operation by 0, 1, or −
1 following a multiplication operation.
3 Assignments
0 Petitions
Accused Products
Abstract
An assembly code performance evaluation apparatus is provided which includes a host computer, a target digital signal processor (DSP) compiler, and a performance estimation program. The host computer includes processing circuitry, memory and a host compiler to use test sequences and generate dynamic information. The target digital signal processor compiler communicates with the processing circuitry. The performance estimation program is implemented on the host processing circuitry and is operative to annotate application source code and to generate an estimation of an optimized assembly code. A method is also provided.
84 Citations
17 Claims
-
1. An assembly code performance evaluation apparatus, comprising:
-
a host computer having processing circuitry, memory and a host compiler, the host compiler operative to execute the program using test sequences and generate dynamic information;
a target digital signal processor (DSP) compiler communicating with the processing circuitry; and
a performance estimation program implemented on the host processing circuitry and operative to annotate application source code, and generate an estimation of an optimized assembly code;
wherein the performance estimation program includes a plurality of rewriting rules that are applied to an RTL intermediate representation and wherein one of the plurality of rewriting rules is operative to remove a shift operation by 0, 1, or −
1 following a multiplication operation.- View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of evaluating performance of assembly code, comprising:
-
providing a target compiler;
providing a C code annotated with dynamic information based upon execution of the C code with test sequences on a host computer;
generating an RTL intermediate representation of the application;
defining a set of rewriting rules;
applying the set of rewriting rules to the RTL intermediate representation; and
generating an estimation of an optimized assembly code;
wherein one of the rewriting rules is operative to remove a shift operation by 0, 1, or −
1 following a multiplication operation.- View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A method of evaluating performance of assembly code, comprising:
-
providing a target compiler;
providing a C code annotated with dynamic information based upon execution of the C code with test sequences on a host computer;
generating an RTL intermediate representation of the application;
defining a set of rewriting rules;
applying the set of rewriting rules to the RTL intermediate representation; and
generating an estimation of an optimized assembly code;
wherein one of the rewriting rules is operative to delete a shift operation by +/−
16 after a memory read or write operation using an indirect addressing mode.
-
-
17. A method of evaluating performance of assembly code, comprising:
-
providing a target compiler;
providing a C code annotated with dynamic information based upon execution of the C code with test sequences on a host computer;
generating an RTL intermediate representation of the application;
defining a set of rewriting rules;
applying the set of rewriting rules to the RTL intermediate representation; and
generating an estimation of an optimized assembly code;
wherein one of the rewriting rules is operative to remove a comparison operation with 0 before a branch operation.
-
Specification