×

Efficient data layouts for convolutional neural networks

  • US 10,489,680 B2
  • Filed: 10/03/2017
  • Issued: 11/26/2019
  • Est. Priority Date: 10/04/2016
  • Status: Active Grant
First Claim
Patent Images

1. A system for executing a convolutional neural network (CNN), the system comprising:

  • non-transitory memory configured to store;

    a convolutional layer of a convolutional neural network,wherein the convolutional layer comprises kernels in a kernel stack,wherein the kernels of the kernel stack are in a basic kernel layout,wherein weight values of the kernels of the kernel stack are reordered from the basic kernel layout into a tile kernel layout comprising a plurality of kernel tiles,wherein a kernel tile comprises a plurality of kernel runnels, andwherein a kernel runnel comprises a number of the weight values of the kernels of the kernel stack; and

    a hardware processor in communication with the non-transitory memory, the hardware processor programmed by executable instructions to;

    receive input activation maps of the convolutional layer, wherein the input activation maps are in a basic input activation map layout;

    reorder pixel values of the input activation maps from the basic input activation map layout into an interleaved input activation map layout comprising a plurality of clusters of input activation map pixels; and

    determine output activation maps of the convolutional layer from the plurality of kernel tiles and the plurality of clusters of input activation map pixels,wherein the output activation maps are in an interleaved output activation map layout comprising a plurality of clusters output activation map pixels.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×