SYSTEMS AND METHODS FOR LAYERED TRAINING IN MACHINE-LEARNING ARCHITECTURES

US 20150127590A1
Filed: 11/04/2013
Published: 05/07/2015
Est. Priority Date: 11/04/2013
Status: Active Grant

First Claim

Patent Images

1. A computer-implemented method for layered training of machine-learning architectures, the method implemented by a training computing device including a processor coupled to a memory, the method comprising:

receiving a plurality of data elements wherein each data element is associated with a timestampdetermining a training window for each model layer of a layered stack of model layers;

determining a plurality of training data elements for each training window by identifying the data elements with timestamps corresponding to each of the training windows;

identifying a previous checkpoint for each model layer if the previous checkpoint for each model layer exists, wherein the previous checkpoint for each model layer is generated by a parent model layer;

training each model layer with the determined training data elements for each model layer and the identified previous checkpoint, if any, for each model layer;

generating a plurality of current checkpoints, wherein each current checkpoint of the plurality of current checkpoints is associated with a model layer; and

;

storing the plurality of current checkpoints at the memory.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A computer-implemented method for layered training of machine-learning architectures includes receiving a plurality of data elements wherein each data element is associated with a timestamp, determining a training window for each model layer of a layered stack of model layers, determining a plurality of training data elements for each training window by identifying the data elements with timestamps corresponding to each of the training windows, identifying a previous checkpoint for each model layer wherein the previous checkpoint for each model layer is generated by a parent model layer, training each model layer with the determined training data elements for each model layer and the identified previous checkpoint for each model layer, generating a plurality of current checkpoints wherein each current checkpoint of the plurality of current checkpoints is associated with a model layer, and storing the plurality of current checkpoints at the memory.

36 Citations

View as Search Results

20 Claims

1. A computer-implemented method for layered training of machine-learning architectures, the method implemented by a training computing device including a processor coupled to a memory, the method comprising:
- receiving a plurality of data elements wherein each data element is associated with a timestampdetermining a training window for each model layer of a layered stack of model layers;
  
  determining a plurality of training data elements for each training window by identifying the data elements with timestamps corresponding to each of the training windows;
  
  identifying a previous checkpoint for each model layer if the previous checkpoint for each model layer exists, wherein the previous checkpoint for each model layer is generated by a parent model layer;
  
  training each model layer with the determined training data elements for each model layer and the identified previous checkpoint, if any, for each model layer;
  
  generating a plurality of current checkpoints, wherein each current checkpoint of the plurality of current checkpoints is associated with a model layer; and
  
  ;
  
  storing the plurality of current checkpoints at the memory.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. The method of claim 1, wherein training each model layer further comprises:
    - adapting each model layer to the determined training data elements for each model layer by applying at least one machine-learning algorithm.
  - 3. The method of claim 1, wherein determining a training window for each model layer further comprises:
    - determining a layer depth of each model layer;
      
      retrieving a training model configuration wherein the training model configuration designates a training delay record associated with each layer depth;
      
      identifying a training delay associated with the layer depth of each model layer based upon the training model configuration; and
      
      calculating a training window based upon the training delay for each model layer.
  - 4. The method of claim 1, further comprising:
    - synchronizing an external server with at least one current checkpoint associated with at least one model layer, wherein the external server serves based at least partially on the synchronized current checkpoint.
  - 5. The method of claim 1, wherein generating a plurality of current checkpoints further comprises:
    - determining a layer depth for each model layer;
      
      retrieving a training model configuration wherein the training model configuration designates a training duration associated with each layer depth;
      
      identifying a training duration associated with the layer depth of each model layer based upon the training model configuration;
      
      training the model layer for the identified training duration; and
      
      processing each model layer into the current checkpoint.
  - 6. The method of claim 1, wherein generating a plurality of current checkpoints further comprises:
    - purging the previous checkpoint for each model layer; and
      
      retraining each model layer.
  - 7. The method of claim 1, wherein storing the plurality of current checkpoints further comprises:
    - validating each checkpoint of the plurality of current checkpoints against the plurality of data elements; and
      
      storing validated checkpoints of the plurality of current checkpoints at the memory.
  - 8. The method of claim 1, wherein receiving a plurality of data elements further comprises:
    - receiving a plurality of conversion data, wherein the conversion data represents conversion activity associated with serving online advertisements.

9. A training computing device for layered training of machine-learning architectures, the training computing device comprising a memory for storing data, and a processor in communication with the memory, said processor programmed to:
- receive a plurality of data elements wherein each data element is associated with a timestamp;
  
  determine a training window for each model layer of a layered stack of model layers;
  
  determine a plurality of training data elements for each training window by identifying the data elements with timestamps corresponding to each of the training windows;
  
  identify a previous checkpoint for each model layer if the previous checkpoint for each model layer exists, wherein the previous checkpoint for each model layer is generated by a parent model layer;
  
  train each model layer with the determined training data elements for each model layer and the identified previous checkpoint, if any, for each model layer;
  
  generate a plurality of current checkpoints, wherein each current checkpoint of the plurality of current checkpoints is associated with a model layer; and
  
  store the plurality of current checkpoints at the memory.
- View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
- - 10. The training computing device of claim 9, wherein the processor is further programmed to:
    - adapt each model layer to the determined training data elements for each model layer by applying at least one machine-learning algorithm.
  - 11. The training computing device of claim 9, wherein the processor is further programmed to:
    - determine a layer depth of each model layer;
      
      retrieve a training model configuration wherein the training model configuration designates a training delay record associated with each layer depth;
      
      identify a training delay associated with the layer depth of each model layer based upon the training model configuration; and
      
      calculate a training window based upon the training delay for each model layer.
  - 12. The training computing device of claim 9, wherein the processor is further programmed to:
    - synchronize an external server with at least one current checkpoint associated with at least one model layer, wherein the external server serves based at least partially on the synchronized current checkpoint.
  - 13. The training computing device of claim 9, wherein the processor is further programmed to:
    - determine a layer depth for each model layer;
      
      retrieve a training model configuration wherein the training model configuration designates a training duration associated with each layer depth;
      
      identify a training duration associated with the layer depth of each model layer based upon the training model configuration;
      
      train the model layer for the identified training duration; and
      
      process each model layer into the current checkpoint.
  - 14. The training computing device of claim 9, wherein the processor is further programmed to:
    - purge the previous checkpoint for each model layer; and
      
      retrain each model layer.
  - 15. The training computing device of claim 9, wherein the processor is further programmed to:
    - validate each checkpoint of the plurality of current checkpoints against the plurality of data elements; and
      
      store validated checkpoints of the plurality of current checkpoints at the memory.
  - 16. The training computing device of claim 9, wherein the processor is further programmed to:
    - receive a plurality of conversion data, wherein the conversion data represents conversion activity associated with serving online advertisements

17. A computer-readable storage device, having processor-executable instructions embodied thereon, for layered training of machine-learning architectures, wherein the computer includes at least one processor and a memory coupled to the processor, wherein, when executed by the computer, the processor-executable instructions cause the computer to:
- receive a plurality of data elements wherein each data element is associated with a timestamp;
  
  determine a training window for each model layer of a layered stack of model layers;
  
  determine a plurality of training data elements for each training window by identifying the data elements with timestamps corresponding to each of the training windows;
  
  identify a previous checkpoint for each model layer if the previous checkpoint for each model layer exists, wherein the previous checkpoint for each model layer is generated by a parent model layer;
  
  train each model layer with the determined training data elements for each model layer and the identified previous checkpoint, if any, for each model layer;
  
  generate a plurality of current checkpoints, wherein each current checkpoint of the plurality of current checkpoints is associated with a model layer; and
  
  store the plurality of current checkpoints at the memory.
- View Dependent Claims (18, 19, 20)
- - 18. The computer-readable storage device of claim 17, wherein the processor-executable instructions cause the computing device to:
    - adapt each model layer to the determined training data elements for each model layer by applying at least one machine-learning algorithm.
  - 19. The computer-readable storage device of claim 17, wherein the processor-executable instructions cause the computing device to:
    - determine a layer depth of each model layer;
      
      retrieve a training model configuration wherein the training model configuration designates a training delay record associated with each layer depth;
      
      identify a training delay associated with the layer depth of each model layer based upon the training model configuration; and
      
      calculate a training window based upon the training delay for each model layer.
  - 20. The computer-readable storage device of claim 17, wherein the processor-executable instructions cause the computing device to:
    - synchronize an external server with at least one current checkpoint associated with at least one model layer, wherein the external server serves based at least partially on the synchronized current checkpoint.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Google LLC (Alphabet Inc.)
Original Assignee
Google Inc. (Alphabet Inc.)
Inventors
Chaudhary, Vinay, Gay, David Roger, Church, Paul, O'Connor, Russell, Litus, Yaroslav

Granted Patent

US 9,286,574 B2
Time in Patent Office

Days
Field of Search
US Class Current

706/12
CPC Class Codes

G06N 20/00   Machine learning

G06N 5/02   Knowledge representation; S...

G06Q 30/0241   Advertisements

G06Q 30/0242   Determining effectiveness o...

SYSTEMS AND METHODS FOR LAYERED TRAINING IN MACHINE-LEARNING ARCHITECTURES

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

36 Citations

20 Claims

Specification

Use Cases

Quick Links

Others

SYSTEMS AND METHODS FOR LAYERED TRAINING IN MACHINE-LEARNING ARCHITECTURES

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

36 Citations

20 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others