COMPUTER ACCELERATOR SYSTEM WITH IMPROVED EFFICIENCY
First Claim
Patent Images
1. A computer comprising:
- a first processor communicating with an external memory and including circuitry to provide execution of a first set of standard computer instructions and circuitry for an exchange of data with the external memory;
a second processor communicating with the first processor including circuitry to provide execution of a second set of accelerator computer instructions providing the execution of functions at an accelerated rate compared to the execution of those functions on the first processor; and
a third processor communicating with the first processor and the second processor and including circuitry to provide for the execution of a third set of memory access instructions, the third processor operating to receive the memory access instructions from the first processor to exchange data between the second processor and external memory via the third processor according to those memory access instructions during operation of the second processor.
2 Assignments
0 Petitions
Accused Products
Abstract
A specialized memory access processor is placed between a main processor and accelerator hardware to handle memory access for the accelerator hardware. The architecture of the memory access processor is designed to allow lower energy memory accesses than can be obtained by the main processor in providing data to the hardware accelerator while providing the hardware accelerator with a sufficiently high bandwidth memory channel. In some embodiments, the main processor may enter a sleep state during accelerator calculations to substantially lower energy consumption.
51 Citations
20 Claims
-
1. A computer comprising:
-
a first processor communicating with an external memory and including circuitry to provide execution of a first set of standard computer instructions and circuitry for an exchange of data with the external memory; a second processor communicating with the first processor including circuitry to provide execution of a second set of accelerator computer instructions providing the execution of functions at an accelerated rate compared to the execution of those functions on the first processor; and a third processor communicating with the first processor and the second processor and including circuitry to provide for the execution of a third set of memory access instructions, the third processor operating to receive the memory access instructions from the first processor to exchange data between the second processor and external memory via the third processor according to those memory access instructions during operation of the second processor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of executing a program using a computer having:
-
a first processor communicating with an external memory and including circuitry to provide execution of a first set of standard computer instructions and circuitry for an exchange of data with the external memory; a second processor communicating with the first processor including circuitry to provide execution of a second set of accelerator computer instructions providing the execution of functions at an accelerated rate compared to the execution of those functions on the first processor; and a third processor communicating with the first processor and the second processor and including circuitry to provide for the execution of a third set of memory access instructions, the third processor receiving the memory access instructions from the first processor to exchange data between the second processor and external memory via the third processor according to those memory access instructions during operation of the second processor; the method comprising the steps of; (a) executing a program by the first processor to a beginning of an acceleration region of the program where faster execution could be provided by the second processor; (b) providing memory access instructions to the third processor for accessing memory for the second processor for execution of the acceleration region; and (c) executing the acceleration region by the second and third processor and not by the first processor. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification