Method and apparatus for reward-based learning of improved systems management policies
First Claim
1. A method for learning a policy for management of at least one component of a data processing system, the method comprising:
- obtaining a decision-making entity for managing said at least one component;
obtaining a reward mechanism for generating numerical measures of value responsive to at least one action performed in at least one state of said at least one component;
applying said decision-making entity and said reward mechanism to said at least one component;
processing a result achieved through application of said decision-making entity and said reward mechanism in accordance with reward-based learning; and
deriving said policy in accordance with said reward-based learning processing.
1 Assignment
0 Petitions
Accused Products
Abstract
In one embodiment, the present invention is a method for reward-based learning of improved systems management policies. One embodiment of the inventive method involves supplying a first policy and a reward mechanism. The first policy maps states of at least one component of a data processing system to selected management actions, while the reward mechanism generates numerical measures of value responsive to particular actions (e.g., management actions) performed in particular states of the component(s). The first policy and the reward mechanism are applied to the component(s), and results achieved through this application (e.g., observations of corresponding states, actions and rewards) are processed in accordance with reward-based learning to derive a second policy having improved performance relative to the first policy in at least one state of the component(s).
-
Citations
20 Claims
-
1. A method for learning a policy for management of at least one component of a data processing system, the method comprising:
-
obtaining a decision-making entity for managing said at least one component;
obtaining a reward mechanism for generating numerical measures of value responsive to at least one action performed in at least one state of said at least one component;
applying said decision-making entity and said reward mechanism to said at least one component;
processing a result achieved through application of said decision-making entity and said reward mechanism in accordance with reward-based learning; and
deriving said policy in accordance with said reward-based learning processing. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer readable medium containing an executable program for learning a policy for management of at least one component of a data processing system, where the program performs the steps of:
-
obtaining a decision-making entity for managing said at least one component;
obtaining a reward mechanism for generating numerical measures of value responsive to at least one action performed in at least one state of said at least one component;
applying said decision-making entity and said reward mechanism to said at least one component;
processing a result achieved through application of said decision-making entity and said reward mechanism in accordance with reward-based learning; and
deriving said policy in accordance with said reward-based learning processing. - View Dependent Claims (18, 19)
-
-
20. Apparatus for learning a policy for management of at least one component of a data processing system, the apparatus comprising:
-
means for obtaining a decision-making entity for managing said at least one component;
means for obtaining a reward mechanism for generating numerical measures of value responsive to at least one action performed in at least one state of said at least one component;
means for applying said decision-making entity and said reward mechanism to said at least one component;
means for processing a result achieved through application of said decision-making entity and said reward mechanism in accordance with reward-based learning; and
means for deriving said policy in accordance with said reward-based learning processing.
-
Specification