×

Low Latency Long Short-Term Memory Inference with Sequence Interleaving

  • US 20200134432A1
  • Filed: 10/31/2018
  • Published: 04/30/2020
  • Est. Priority Date: 10/31/2018
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising:

  • a processing unit;

    a machine learning engine comprising a plurality of matrix multiplication units and one or more long short-term memory (LSTM) layers; and

    a memory coupled to the processing unit and the machine learning engine;

    wherein the processing unit is configured to;

    detect, in the memory, a plurality of sequences that will be processed by the machine learning engine; and

    interleave the plurality of sequences together into data blocks, wherein each data block comprises samples from the plurality of sequences;

    wherein the machine learning engine is configured to;

    receive a given data block;

    perform, in parallel, a plurality of matrix multiplication operations on a plurality of sequences from the given data block and a plurality of coefficients; and

    convey outputs from the plurality of matrix multiplication units to the one or more LSTM layers.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×