×

Method and apparatus for hardware-accelerated machine learning

  • US 10,846,624 B2
  • Filed: 02/19/2020
  • Issued: 11/24/2020
  • Est. Priority Date: 12/22/2016
  • Status: Active Grant
First Claim
Patent Images

1. A machine-learning apparatus comprising:

  • a feature extractor for a convolutional neural network, wherein the feature extractor is deployed on a member of the group consisting of (1) a reconfigurable logic device, (2) a graphics processing unit (GPU), and (3) a chip multi-processor (CMP), wherein the member comprises a plurality of data processing engines arranged as a multi-functional pipeline through which data is streamed, the pipelined data processing engines configured for operation in parallel with each other;

    each pipelined data processing engine being configured to (1) receive streaming data and perform a processing operation on the received streaming data, and (2) be responsive to a control instruction that defines whether that pipelined data processing engine is an activated data processing engine or a deactivated data processing engine, wherein an activated data processing engine is configured to perform its processing operation on streaming data received thereby, and wherein a deactivated data processing engine remains in the pipeline but does not perform its processing operation on streaming data received thereby, the multi-functional pipeline thereby being configured to provide a plurality of different pipeline functions in response to control instructions that are configured to selectively activate and deactivate the pipelined data processing engines, each pipeline function being the combined functionality of each activated pipelined data processing engine in the pipeline at a given time;

    wherein each of a plurality of the data processing engines is configured as a convolution engine that convolves first data with second data via correlation logic;

    wherein each of another plurality of the data processing engines is configured as a data reduction engine that performs a data reduction operation on data received thereby; and

    wherein the multi-functional pipeline is configured to activate a plurality of the convolution engines and a plurality of the data reduction engines at the same time in response to control instructions in order to configure the multi-functional pipeline as the feature extractor for the convolutional neural network.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×