×

Low rank matrix compression

  • US 11,037,330 B2
  • Filed: 04/08/2017
  • Issued: 06/15/2021
  • Est. Priority Date: 04/08/2017
  • Status: Active Grant
First Claim
Patent Images

1. A general purpose graphics processor comprising:

  • an instruction cache to receive a stream of instructions;

    an instruction unit to execute the stream of instructions;

    a general-purpose graphics processing compute block comprising a plurality of graphics processing cores;

    a shared memory communicatively coupled to the plurality of graphics processing cores; and

    a processor to;

    apply a matrix interpolation operation to one or more linearly dependent rows of a matrix comprising weights of a neural network;

    apply a singular value decomposition algorithm to convert one or more weights of one or more linearly dependent rows of the matrix to a low rank;

    characterize one or more rows of the matrix comprising weights of a neural network for which a rank of the one or more rows of the matrix is less than a threshold value as independent rows of the matrix;

    determine a scalar associated with each of the one or more independent rows of the matrix;

    encode a plurality of the one or more independent rows with the scalar associated with the row to generate encoded weight data;

    apply delta compression to compress the encoded weight data;

    store the encoded weight data in the shared memory; and

    load the matrix into the neural network using hardware when the rank is beneath a threshold.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×