×

PROCESSING METHOD AND ACCELERATING DEVICE

  • US 20200134460A1
  • Filed: 11/28/2019
  • Published: 04/30/2020
  • Est. Priority Date: 05/23/2017
  • Status: Active Application
First Claim
Patent Images

1. A data compression method, comprising:

  • performing coarse-grained pruning on weights of a neural network, which includes;

    selecting M weights from the neural network through a sliding window, and setting all or part of the M weights to 0 when the M weights meet a preset condition, where the M is an positive integer greater than 0;

    performing a first retraining on the neural network, where the weight which has been set to 0 in the retraining process remains 0; and

    quantizing the weights of the neural network, which includes;

    grouping the weights of the neural network;

    performing a clustering operation on each group of weights by using a clustering algorithm, computing a center weight of each class, and replacing all the weights in each class by the center weights.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×