STAGED TRAINING OF NEURAL NETWORKS FOR IMPROVED TIME SERIES PREDICTION PERFORMANCE
First Claim
1. An apparatus comprising a processor and a storage to store instructions that, when executed by the processor, cause the processor to perform operations comprising:
- train a first neural network of a chain of neural networks to generate a first portion of multiple portions of time series data that corresponds to a temporally earliest subrange of time of multiple subranges of time within a full range of time that is covered by the time series data, wherein;
the chain comprises a set of neural networks ordered to start with the first neural network at a head of the chain and to end with a last neural network at a tail of the chain;
each neural network of the chain comprises external inputs, additional inputs and outputs;
each neural network of the chain generates a portion of the multiple portions of the time series data at the outputs of the neural network from input data values provided at the external inputs of the neural network;
each portion of the multiple portions of the time series data corresponds to a subrange of the multiple subranges; and
the set of neural networks is interconnected within the chain such that each neural network, except the first neural network at the head of the chain, receives, at the additional inputs of the neural network, a portion of the multiple portions of the time series data that is generated at the outputs of a preceding neural network in the ordering of neural networks within the chain;
retrieve, from the first neural network, a first neural network configuration data comprising hyperparameters and first trained parameters learned by the first neural network from the training of the first neural network;
train, using at least the first neural network configuration data, a next neural network in the ordering of neural networks within the chain to generate a next portion of the multiple portions that corresponds to a next subrange of time of the multiple subranges of time that temporally follows the earliest subrange;
retrieve, from the next neural network, a next neural network configuration data comprising the hyperparameters and next trained parameters learned by the next neural network from the training of the next neural network; and
use at least the first neural network configuration data and the next neural network configuration data to instantiate the chain.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus includes a processor to: train a first neural network of a chain to generate first configuration data including first trained parameters, wherein the chain performs an analytical function generating a set of output values from a set of input values, each neural network has inputs to receive the set of input values and outputs to output a portion of the set of output values, and the neural networks are ordered from the first at the head to a last neural network at the tail, and are interconnected so that each neural network additionally receives the outputs of a preceding neural network; train, using the first configuration data, a next neural network in the chain ordering to generate next configuration data including next trained parameters; and use at least the first and next configuration data and data indicating the interconnections to instantiate the chain to perform the analytical function.
-
Citations
30 Claims
-
1. An apparatus comprising a processor and a storage to store instructions that, when executed by the processor, cause the processor to perform operations comprising:
-
train a first neural network of a chain of neural networks to generate a first portion of multiple portions of time series data that corresponds to a temporally earliest subrange of time of multiple subranges of time within a full range of time that is covered by the time series data, wherein; the chain comprises a set of neural networks ordered to start with the first neural network at a head of the chain and to end with a last neural network at a tail of the chain; each neural network of the chain comprises external inputs, additional inputs and outputs; each neural network of the chain generates a portion of the multiple portions of the time series data at the outputs of the neural network from input data values provided at the external inputs of the neural network; each portion of the multiple portions of the time series data corresponds to a subrange of the multiple subranges; and the set of neural networks is interconnected within the chain such that each neural network, except the first neural network at the head of the chain, receives, at the additional inputs of the neural network, a portion of the multiple portions of the time series data that is generated at the outputs of a preceding neural network in the ordering of neural networks within the chain; retrieve, from the first neural network, a first neural network configuration data comprising hyperparameters and first trained parameters learned by the first neural network from the training of the first neural network; train, using at least the first neural network configuration data, a next neural network in the ordering of neural networks within the chain to generate a next portion of the multiple portions that corresponds to a next subrange of time of the multiple subranges of time that temporally follows the earliest subrange; retrieve, from the next neural network, a next neural network configuration data comprising the hyperparameters and next trained parameters learned by the next neural network from the training of the next neural network; and use at least the first neural network configuration data and the next neural network configuration data to instantiate the chain. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-program product tangibly embodied in a non-transitory machine-readable storage medium, the computer-program product including instructions operable to cause a processor to perform operations comprising:
-
train a first neural network of a chain of neural networks to generate a first portion of multiple portions of time series data that corresponds to a temporally earliest subrange of time of multiple subranges of time within a full range of time that is covered by the time series data, wherein; the chain comprises a set of neural networks ordered to start with the first neural network at a head of the chain and to end with a last neural network at a tail of the chain; each neural network of the chain comprises external inputs, additional inputs and outputs; each neural network of the chain generates a portion of the multiple portions of the time series data at the outputs of the neural network from input data values provided at the external inputs of the neural network; each portion of the multiple portions of the time series data corresponds to a subrange of the multiple subranges; and the set of neural networks is interconnected within the chain such that each neural network, except the first neural network at the head of the chain, receives, at the additional inputs of the neural network, a portion of the multiple portions of the time series data that is generated at the outputs of a preceding neural network in the ordering of neural networks within the chain; retrieve, from the first neural network, a first neural network configuration data comprising hyperparameters and first trained parameters learned by the first neural network from the training of the first neural network; train, using at least the first neural network configuration data, a next neural network in the ordering of neural networks within the chain to generate a next portion of the multiple portions that corresponds to a next subrange of time of the multiple subranges of time that temporally follows the earliest subrange; retrieve, from the next neural network, a next neural network configuration data comprising the hyperparameters and next trained parameters learned by the next neural network from the training of the next neural network; and use at least the first neural network configuration data and the next neural network configuration data to instantiate the chain. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer-implemented method comprising:
-
training, by a processor, a first neural network of a chain of neural networks to generate a first portion of multiple portions of time series data that corresponds to a temporally earliest subrange of time of multiple subranges of time within a full range of time that is covered by the time series data, wherein; the chain comprises a set of neural networks ordered to start with the first neural network at a head of the chain and to end with a last neural network at a tail of the chain; each neural network of the chain comprises external inputs, additional inputs and outputs; each neural network of the chain generates a portion of the multiple portions of the time series data at the outputs of the neural network from input data values provided at the external inputs of the neural network; each portion of the multiple portions of the time series data corresponds to a subrange of the multiple subranges; and the set of neural networks is interconnected within the chain such that each neural network, except the first neural network at the head of the chain, receives, at the additional inputs of the neural network, a portion of the multiple portions of the time series data that is generated at the outputs of a preceding neural network in the ordering of neural networks within the chain; retrieving, from the first neural network, a first neural network configuration data comprising hyperparameters and first trained parameters learned by the first neural network from the training of the first neural network; training, by the processor and using at least the first neural network configuration data, a next neural network in the ordering of neural networks within the chain to generate a next portion of the multiple portions that corresponds to a next subrange of time of the multiple subranges of time that temporally follows the earliest subrange; retrieving, from the next neural network, a next neural network configuration data comprising the hyperparameters and next trained parameters learned by the next neural network from the training of the next neural network; and using, by the processor, at least the first neural network configuration data and the next neural network configuration data to instantiate the chain. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification