×

NEURAL NETWORK PROCESSOR BASED ON APPLICATION SPECIFIC SYNTHESIS SPECIALIZATION PARAMETERS

  • US 20190325296A1
  • Filed: 04/21/2018
  • Published: 10/24/2019
  • Est. Priority Date: 04/21/2018
  • Status: Active Grant
First Claim
Patent Images

1. A method, implemented by a processor, for synthesizing a neural network processor comprising a plurality of tile engines, wherein each of the plurality of tile engines is configured to process matrix elements and vector elements, the method comprising:

  • using the processor analyzing a neural network model corresponding to an application to determine;

    (1) a first minimum number of units required to express a shared exponent value required to satisfy a first precision requirement corresponding to each of the matrix elements and corresponding to each of the vector elements, (2) a second minimum number of units required to express a first mantissa value required to satisfy a second precision requirement corresponding to the each of the matrix elements, and (3) a third minimum number of units required to express a second mantissa value required to satisfy a third precision requirement corresponding to the each of the vector elements;

    obtaining code representative of at least a portion of at least one hardware node for implementing the neural network processor;

    obtaining a synthesis model comprising a plurality of synthesis specialization parameters including;

    (1) a first synthesis specialization parameter corresponding to a first native dimension of the each of the matrix elements, (2) a second synthesis specialization parameter corresponding to a second native dimension of the each of the vector elements, and (3) a third synthesis specialization parameter corresponding to a number of the plurality of tile engines, wherein each of a first value corresponding to the first synthesis specialization parameter, a second value corresponding to the second synthesis specialization parameter, and a third value corresponding to the third synthesis specialization parameter is selected to meet or exceed a performance metric associated with the at least one hardware node; and

    using the processor modifying the code, based on at least the first minimum number of units, the second minimum number of units, the third minimum number of units and at least the first value and the second value, to generate a modified version of the code and storing a modified version of the code.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×