×

Machine learning classification on hardware accelerators with stacked memory

  • US 10,452,995 B2
  • Filed: 06/29/2015
  • Issued: 10/22/2019
  • Est. Priority Date: 06/29/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method for processing on an acceleration component a machine learning classification model comprising a plurality of decision trees, the decision trees comprising a first amount of decision tree data, the acceleration component comprising an acceleration component die and a memory stack disposed in an integrated circuit package, the memory stack comprising an acceleration component memory having a second amount of memory less than the first amount of decision tree data, the memory stack comprising a memory bandwidth greater than 50 GB/sec and a power efficiency of greater than 20 MB/sec/mW, the method comprising:

  • slicing the model into a plurality of model slices, each of the model slices having a third amount of decision tree data less than or equal to the second amount of memory;

    storing the plurality of model slices on the memory stack;

    copying a first model slice to the acceleration component memory;

    processing the first model slice using a set of input data on the acceleration component to produce a first slice result;

    selecting, based at least in part on the first slice result, a second model slice; and

    repeating the copying and the processing for the second model slice;

    wherein the selecting of the second model slice results in a third model slice not being processed.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×