Intelligent controller with neural network and reinforcement learning
First Claim
1. A plant controller using reinforcement learning for controlling a plant by generating a control signal therefor, the plant controller comprising:
- an action network for coupling to a plant and receiving therefrom a plant state signal, receiving a reinforcement signal and in accordance therewith generating a plant control signal, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant, the reinforcement signal represents a desired state of a performance control parameter for the action network, and the plant control signal represents a desired state of the quantitative plant performance parameter, and further wherein the plant control signal is generated in further accordance with unsupervised learning; and
a critic network, coupled to the action network, for coupling to the plant and receiving therefrom the plant state signal and a plant performance signal and in accordance therewith outputting the reinforcement signal to the action network, wherein the plant performance signal represents a qualitative plant performance parameter for the plant.
1 Assignment
0 Petitions
Accused Products
Abstract
A plant controller using reinforcement learning for controlling a plant includes action and critic networks with enhanced learning for generating a plant control signal. Learning is enhanced within the action network by using a neural network configured to operate according to unsupervised learning techniques based upon a Kohonen Feature Map. Learning is enhanced within the critic network by using a distance parameter which represents the difference between the actual and desired states of the quantitative performance, or output, of the plant when generating the reinforcement signal for the action network.
73 Citations
32 Claims
-
1. A plant controller using reinforcement learning for controlling a plant by generating a control signal therefor, the plant controller comprising:
-
an action network for coupling to a plant and receiving therefrom a plant state signal, receiving a reinforcement signal and in accordance therewith generating a plant control signal, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant, the reinforcement signal represents a desired state of a performance control parameter for the action network, and the plant control signal represents a desired state of the quantitative plant performance parameter, and further wherein the plant control signal is generated in further accordance with unsupervised learning; and a critic network, coupled to the action network, for coupling to the plant and receiving therefrom the plant state signal and a plant performance signal and in accordance therewith outputting the reinforcement signal to the action network, wherein the plant performance signal represents a qualitative plant performance parameter for the plant. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A plant controller using reinforcement learning for controlling a plant by generating a control signal therefor, the plant controller comprising:
-
an action network for coupling to a plant and receiving therefrom a plant state signal, receiving a reinforcement signal and in accordance therewith generating a plant control signal, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant, the reinforcement signal represents a desired state of a performance control parameter for the action network, and the plant control signal represents a desired state of the quantitative plant performance parameter; and a critic network, coupled to the action network, for coupling to the plant and receiving therefrom the plant state signal and a plant performance signal and in accordance therewith outputting the reinforcement signal to the action network, wherein the plant performance signal represents a qualitative plant performance parameter for the plant, and further wherein the reinforcement signal is outputted in further accordance with a distance parameter which represents a difference between the actual and desired states of the quantitative plant performance parameter. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A plant control method using reinforcement learning for controlling a plant by generating a control signal therefor, comprising the steps of:
-
receiving a plant state signal and a plant performance signal from a plant, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant and the plant performance signal represents a qualitative plant performance parameter for the plant; generating a reinforcement signal in accordance with the plant state signal and the plant performance signal, wherein the reinforcement signal represents a desired state of a performance control parameter; receiving the reinforcement signal; and generating a plant control signal in accordance with the plant state signal and the reinforcement signal and in further accordance with unsupervised learning, wherein the plant control signal represents a desired state of the quantitative plant performance parameter. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A plant control method using reinforcement learning for controlling a plant by generating a control signal therefor, comprising the steps of:
-
receiving a plant state signal and a plant performance signal from a plant, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant and the plant performance signal represents a qualitative plant performance parameter for the plant, generating a reinforcement signal in accordance with the plant state signal and the plant performance signal and in further accordance with a distance parameter which represents a difference between the actual state and a desired state of the quantitative plant performance parameter, wherein the reinforcement signal represents a desired state of a performance control parameter; receiving the reinforcement signal; and generating a plant control signal in accordance with the plant state signal and the reinforcement signal, wherein the plant control signal represents the desired state of the quantitative plant performance parameter. - View Dependent Claims (20, 21, 22, 23, 24)
-
-
25. A plant controller using reinforcement learning for controlling a plant by generating a control signal therefor, the plant controller comprising:
-
an action network for coupling to a plant and receiving therefrom a plant state signal, receiving a reinforcement signal and in accordance therewith generating a plant control signal, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant, the reinforcement signal represents a desired state of a performance control parameter for the action network, and the plant control signal represents a desired state of the quantitative plant performance parameter, and wherein the plant control signal is generated in further accordance with unsupervised learning, and further wherein the action network comprises an unsupervised learning based neural network, and still further wherein the unsupervised learning based neural network is configured to perform Kohonen feature mapping; and a critic network, coupled to the action network, for coupling to the plant and receiving therefrom the plant state signal and a plant performance signal and in accordance therewith outputting the reinforcement signal to the action network, wherein the plant performance signal represents a qualitative plant performance parameter for the plant.
-
-
26. A plant controller using reinforcement learning for controlling a plant by generating a control signal therefor, the plant controller comprising:
-
an action network for coupling to a plant and receiving therefrom a plant state signal, receiving a reinforcement signal and in accordance therewith generating a plant control signal, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant, the reinforcement signal represents a desired state of a performance control parameter for the action network, and the plant control signal represents a desired state of the quantitative plant performance parameter, and wherein the plant control signal is generated in further accordance with unsupervised learning, and further wherein the action network generates the plant control signal in still further accordance with a plurality of fuzzy logic rules; and a critic network, coupled to the action network, for coupling to the plant and receiving therefrom the plant state signal and a plant performance signal and in accordance therewith outputting the reinforcement signal to the action network, wherein the plant performance signal represents a qualitative plant performance parameter for the plant.
-
-
27. A plant controller using reinforcement learning for controlling a plant by generating a control signal therefor, the plant controller comprising:
-
an action network for coupling to a plant and receiving therefrom a plant state signal, receiving a reinforcement signal and in accordance therewith generating a plant control signal, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant, the reinforcement signal represents a desired state of a performance control parameter for the action network, and the plant control signal represents a desired state of the quantitative plant performance parameter, and wherein the action network comprises a neural network, and further wherein the neural network is configured to perform Kohonen feature mapping; and a critic network, coupled to the action network, for coupling to the plant and receiving therefrom the plant state signal and a plant performance signal and in accordance therewith outputting the reinforcement signal to the action network, wherein, the plant performance signal represents a qualitative plant performance parameter for the plant, and further wherein the reinforcement signal is outputted in further accordance with a distance parameter which represents a difference between the actual and desired states of the quantitative plant performance parameter.
-
-
28. A plant controller using reinforcement learning for controlling a plant by generating a control signal therefor, the plant controller comprising:
-
an action network for coupling to a plant and receiving therefrom a plant state signal, receiving a reinforcement signal and in accordance therewith generating a plant control signal, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant, the reinforcement signal represents a desired state of a performance control parameter for the action network, and the plant control signal represents a desired state of the quantitative plant performance parameter, and wherein the action network generates the plant control signal in still further accordance with a plurality of fuzzy logic rules; and a critic network, coupled to the action network, for coupling to the plant and receiving therefrom the plant state signal and a plant performance signal and in accordance therewith outputting the reinforcement signal to the action network, wherein the plant performance signal represents a qualitative plant performance parameter for the plant, and further wherein the reinforcement signal is outputted in further accordance with a distance parameter which represents a difference between the actual and desired states of the quantitative plant performance parameter.
-
-
29. A plant control method using reinforcement learning for controlling a plant by generating a control signal therefor, comprising the steps of:
-
receiving a plant state signal and a plant performance signal from a plant, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant and the plant performance signal represents a qualitative plant performance parameter for the plant; generating a reinforcement signal in accordance with the plant state signal and the plant performance signal, wherein the reinforcement signal represents a desired state of a performance control parameter; receiving the reinforcement signal; and generating a plant control signal in accordance with the plant state signal and the reinforcement signal and in further accordance with unsupervised learning and Kohonen feature mapping with an unsupervised learning based neural network which includes first and second pluralities of inter-neuron weights data associated therewith, wherein the plant control signal represents a desired state of the quantitative plant performance parameter.
-
-
30. A plant control method using reinforcement learning for controlling a plant by generating a control signal therefor, comprising the steps of:
-
receiving a plant state signal and a plant performance signal from a plant, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant and the plant performance signal represents a qualitative plant performance parameter for the plant; generating a reinforcement signal in accordance with the plant state signal and the plant performance signal, wherein the reinforcement signal represents a desired state of a performance control parameter; receiving the reinforcement signal; and generating a plant control signal in accordance with the plant state signal and the reinforcement signal and in further accordance with unsupervised learning and in still further accordance with a plurality of fuzzy logic rules, wherein the plant control signal represents a desired state of the quantitative plant performance parameter.
-
-
31. A plant control method using reinforcement learning for controlling a plant by generating a control signal therefor, comprising the steps of:
-
receiving a plant state signal and a plant performance signal from a plant, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant and the plant performance signal represents a qualitative plant performance parameter for the plant, generating a reinforcement signal in accordance with the plant state signal and the plant performance signal and in further accordance with a distance parameter which represents a difference between the actual state and a desired state of the quantitative plant performance parameter, wherein the reinforcement signal represents a desired state of a performance control parameter; receiving the reinforcement signal; and generating a plant control signal in accordance with the plant state signal, the reinforcement signal and Kohonen feature mapping with a neural network which includes first and second pluralities of inter-neuron weights data associated therewith, wherein the plant control signal represents the desired state of the quantitative plant performance parameter.
-
-
32. A plant control method using reinforcement learning for controlling a plant by generating a control signal therefor, comprising the steps of:
-
receiving a plant state signal and a plant performance signal from a plant, wherein the plant state signal represents an actual state of a quantitative plant performance parameter for the plant and the plant performance signal represents a qualitative plant performance parameter for the plant, generating a reinforcement signal in accordance with the plant state signal and the plant performance signal and in further accordance with a distance parameter which represents a difference between the actual state and a desired state of the quantitative plant performance parameter, wherein the reinforcement signal represents a desired state of a performance control parameter; receiving the reinforcement signal; and generating a plant control signal in accordance with the plant state signal, the reinforcement signal and a plurality of fuzzy logic rules, wherein the plant control signal represents the desired state of the quantitative plant performance parameter.
-
Specification