Resource cost optimization system, method, and program
First Claim
1. A computer implemented method for generating a policy for optimizing a cost of a resource under a predetermined cost structure, the method comprising:
- storing, in a computer-readable medium, an error distribution that indicates a deviation of an amount of usage from a predicted value referred to as usage amount error, a characteristic of a storage battery configured to store or release the resource, wherein the characteristic includes an amount of the resource in the storage battery, and the cost structure;
calculating, using a computer processor processing a Markov decision process, an expected cost and a parameter that includes a transition probability on the basis of the error distribution, the characteristic of the storage battery, and the cost structure, the Markov decision process including a state defined by at least the usage amount error, the amount of the resource in the storage battery, a specification of a subsection within a section of usage interval, and a set target for a next section; and
deciding, using the computer processor, and implementing an optimal policy for the next section that includes an action of storing or releasing the resource in the storage battery for the state of the Markov decision process using the expected cost in the Markov decision process and the parameter including the transition probability, wherein the resource comprises electric power.
1 Assignment
0 Petitions
Accused Products
Abstract
Apparatus and method use a Markov decision process (MDP) to reduce the cost of variations in electric power usage. The user notifies a power company of a predicted value for a period. The period is divided into subsections. For each subsection, on the basis of a MDP including a state that depends on an electric power usage amount error, charge amount, and set target, the amount of charging and discharging of a storage battery as an action at any given time is optimally decided depending on the electric power usage amount error, charge amount, time, and set target at that time. A predetermined time in a subsection is a target setting time, at which a future target is further set as the action. The action includes deciding the charging and discharging amount in that subsection and deciding a future target in a subsection whose target should be set.
9 Citations
10 Claims
-
1. A computer implemented method for generating a policy for optimizing a cost of a resource under a predetermined cost structure, the method comprising:
-
storing, in a computer-readable medium, an error distribution that indicates a deviation of an amount of usage from a predicted value referred to as usage amount error, a characteristic of a storage battery configured to store or release the resource, wherein the characteristic includes an amount of the resource in the storage battery, and the cost structure; calculating, using a computer processor processing a Markov decision process, an expected cost and a parameter that includes a transition probability on the basis of the error distribution, the characteristic of the storage battery, and the cost structure, the Markov decision process including a state defined by at least the usage amount error, the amount of the resource in the storage battery, a specification of a subsection within a section of usage interval, and a set target for a next section; and deciding, using the computer processor, and implementing an optimal policy for the next section that includes an action of storing or releasing the resource in the storage battery for the state of the Markov decision process using the expected cost in the Markov decision process and the parameter including the transition probability, wherein the resource comprises electric power. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer program product for generating a policy for optimizing a cost of a resource under a predetermined cost structure, the program product comprising a computer-readable storage medium having program code embodied therewith, the program code being executable by a processor to perform a method comprising:
-
a step of storing an error distribution that indicates a deviation of an amount of usage from a predicted value referred to as usage amount error, a characteristic of a storage battery configured to store or release the resource the resource, wherein the characteristic includes an amount of the resource in the storage battery, and the cost structure in a computer-readable form; a step of calculating, using a Markov decision process, an expected cost and a parameter that includes a transition probability on the basis of the error distribution, the characteristic of the storage battery, and the cost structure, the Markov decision process including a state defined by at least the usage amount error, the amount of resource in the storage battery, a specification of a subsection within a section of usage interval, and a set target for a next section; and a step of deciding an optimal policy for implementation, the optimal policy includes an action of storing or releasing the resource in the storage battery for the state of the Markov decision process using the expected cost in the Markov decision process and the parameter including the transition probability, wherein the resource comprises electric power. - View Dependent Claims (7, 8, 9, 10)
-
Specification