×

METHOD AND DEVICE FOR PARALLEL PROCESSING IN MODEL TRAINING

  • US 20150019214A1
  • Filed: 12/16/2013
  • Published: 01/15/2015
  • Est. Priority Date: 07/10/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method of training a Deep Neural Network (DNN) model, comprising:

  • at a device comprising one or more processors and memory;

    establishing an initial DNN model;

    dividing a training data corpus into a plurality of disjoint data subsets;

    for each of the plurality of disjoint data subsets, providing the data subset to a respective training processing unit of a plurality of training processing units operating in parallel, wherein the respective training processing unit applies a Stochastic Gradient Descent (SGD) process to update the initial DNN model to generate a respective DNN sub-model based on the data subset; and

    merging the respective DNN sub-models generated by the plurality of training processing units to obtain an intermediate DNN model, wherein the intermediate DNN model is established as either the initial DNN model for a next training iteration or a final DNN model in accordance with a preset convergence condition.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×