Learning method for a neural network
First Claim
1. A learning method for training a recurrent neural network having a plurality of inputs and a plurality of outputs and at least one return line connecting an output to an input, comprising the steps of:
- a) separating said at least one return line during training of the neural network and using the input connected to said return line as a training input together with the other inputs;
b) in a computer, interpreting input quantities supplied to the inputs of said neural network for training as a time series of a set of values of a variable input quantity representing respective values of the input quantity at discrete points in time;
c) in said computer identifying a statistical noise distribution of an uncorrelated noise of finite variance that has a chronological average of zero and is superimposed on the measured values;
d) in said computer generating a respective inputs values for any additional training inputs by, for each input value for each additional training input, treating the input value as a missing value in said time series, calculating a statistical missing value noise distribution according to said known noise distribution from at least one of said input quantity values neighboring the missing value in the time series and calculating said value of the missing value by replacing the missing value with at least two Monte Carlo samples of the missing value obtained according to the missing value noise distribution; and
e) training said neural network using said time series and a behavior of a technical system represented by the neural network.
1 Assignment
0 Petitions
Accused Products
Abstract
In a learning method for training a recurrent neural network having a number of inputs and a number of outputs with at least one output being connected via a return line to an input, the return line is separated during training of the neural network, thereby freeing the input connected to the return line for use as an additional input during training, together with the other inputs. The additional input values, which must be estimated or predicted for supply to the thus-produced additional training inputs, are generated by treating each additional input value to be generated as a missing value in the time series of input quantities. Error distribution densities for the additional input values are calculated on the basis of the known values from the time series and their known or predetermined error distribution density, and samples are taken from this error distribution density according to the Monte Carlo method. These each lead to an estimated or predicted value whose average is introduced for the additional input value to be predicted. The method can be employed for the operation as well as for the training of the neural network, and is suitable for use in all known fields of utilization of neural networks.
-
Citations
13 Claims
-
1. A learning method for training a recurrent neural network having a plurality of inputs and a plurality of outputs and at least one return line connecting an output to an input, comprising the steps of:
-
a) separating said at least one return line during training of the neural network and using the input connected to said return line as a training input together with the other inputs; b) in a computer, interpreting input quantities supplied to the inputs of said neural network for training as a time series of a set of values of a variable input quantity representing respective values of the input quantity at discrete points in time; c) in said computer identifying a statistical noise distribution of an uncorrelated noise of finite variance that has a chronological average of zero and is superimposed on the measured values; d) in said computer generating a respective inputs values for any additional training inputs by, for each input value for each additional training input, treating the input value as a missing value in said time series, calculating a statistical missing value noise distribution according to said known noise distribution from at least one of said input quantity values neighboring the missing value in the time series and calculating said value of the missing value by replacing the missing value with at least two Monte Carlo samples of the missing value obtained according to the missing value noise distribution; and e) training said neural network using said time series and a behavior of a technical system represented by the neural network. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
Specification