ARTIFICIAL NEURAL NETWORKS HAVING COMPETITIVE REWARD MODULATED SPIKE TIME DEPENDENT PLASTICITY AND METHODS OF TRAINING THE SAME
First Claim
1. A method of training an artificial neural network having a plurality of layers and at least one weight matrix encoding connection weights between neurons in successive layers of the plurality of layers, the method comprising:
- receiving, at an input layer of the plurality of layers, at least one input;
generating, at an output layer of the plurality of layers, at least one output based on the at least one input;
generating a reward based on a comparison between the at least one output and a desired output; and
modifying the connection weights based on the reward, wherein the modifying the connection weights comprises maintaining a sum of synaptic input weights to each neuron to be substantially constant and maintaining a sum of synaptic output weights from each neuron to be substantially constant.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of training an artificial neural network having a series of layers and at least one weight matrix encoding connection weights between neurons in successive layers. The method includes receiving, at an input layer of the series of layers, at least one input, generating, at an output layer of the series of layers, at least one output based on the at least one input, generating a reward based on a comparison of between the at least one output and a desired output, and modifying the connection weights based on the reward. Modifying the connection weights includes maintaining a sum of synaptic input weights to each neuron to be substantially constant and maintaining a sum of synaptic output weights from each neuron to be substantially constant.
-
Citations
20 Claims
-
1. A method of training an artificial neural network having a plurality of layers and at least one weight matrix encoding connection weights between neurons in successive layers of the plurality of layers, the method comprising:
-
receiving, at an input layer of the plurality of layers, at least one input; generating, at an output layer of the plurality of layers, at least one output based on the at least one input; generating a reward based on a comparison between the at least one output and a desired output; and modifying the connection weights based on the reward, wherein the modifying the connection weights comprises maintaining a sum of synaptic input weights to each neuron to be substantially constant and maintaining a sum of synaptic output weights from each neuron to be substantially constant. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system, comprising:
-
a processor; and a non-transitory computer-readable storage medium operably coupled to the processor, the non-transitory computer-readable storage medium having software instructions stored therein, which, when executed by the processor, cause the processor to; process input parameters with an artificial neural network stored in the processor; generate, from the artificial neural network, at least one output based on the input parameters; generate a reward based on a comparison of between the output and a desired output; and modify connection weights between neurons in the artificial neural network based on the reward, wherein the modifying the connection weights comprises maintaining a sum of synaptic input weights to each neuron to be substantially constant and maintaining a sum of synaptic output weights from each neuron to be substantially constant. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A method for controlling a vehicle component of a vehicle having a plurality of sensors and a processor in communication with the plurality of sensors, the method comprising:
-
receiving input parameters from the plurality of sensors; processing the input parameters with an artificial neural network stored in the processor; controlling the vehicle component based on output parameters calculated by the artificial neural network; determining a reward based on a comparison between a desired behavior of the vehicle and a behavior of the vehicle resulting from the controlling of the vehicle component; and modifying connection weights between neurons in the artificial neural network based on the reward, wherein the modifying the connection weights comprises maintaining a sum of synaptic input weights to each neuron to be substantially constant and maintaining a sum of synaptic output weights from each neuron to be substantially constant. - View Dependent Claims (17, 18, 19, 20)
-
Specification