AUTOMATED LEARNING OF FAILURE RECOVERY POLICIES
First Claim
Patent Images
1. A system comprising:
- a model learning component configured to access collected observable interactions of an existing repair policy with a process to build a model of the process, the model mapping states of the process to repair actions of the existing repair policy;
a policy computation component configured to compute a new policy based upon the model, the new policy identifying a number of times to retry a first one of the repair actions when the process is in a first one of the states of the process; and
a controller configured to apply the new policy to the process and, in an instance when the first state is identified, retry the first repair action the number of times identified by the new policy before escalating the first repair action to a second one of the repair actions; and
one or more processing units configured to execute at least one of the model learning component, the policy computation component, or the policy application component.
4 Assignments
0 Petitions
Accused Products
Abstract
Described is automated learning of failure recovery policies based upon existing information regarding previous policies and actions. A learning mechanism automatically constructs a new policy for controlling a recovery process, based upon collected observable interactions of an existing policy with the process. In one aspect, the learning mechanism builds a partially observable Markov decision process (POMDP) model, and computes the new policy base upon the learned model. The new policy may perform automatic fault recovery, e.g., on a machine in a datacenter corresponding to the controlled process.
-
Citations
30 Claims
-
1. A system comprising:
-
a model learning component configured to access collected observable interactions of an existing repair policy with a process to build a model of the process, the model mapping states of the process to repair actions of the existing repair policy; a policy computation component configured to compute a new policy based upon the model, the new policy identifying a number of times to retry a first one of the repair actions when the process is in a first one of the states of the process; and a controller configured to apply the new policy to the process and, in an instance when the first state is identified, retry the first repair action the number of times identified by the new policy before escalating the first repair action to a second one of the repair actions; and one or more processing units configured to execute at least one of the model learning component, the policy computation component, or the policy application component. - View Dependent Claims (2, 5, 6, 7, 8, 9)
-
- 3. (canceled)
-
10-20. -20. (canceled)
-
21. A method comprising:
-
accessing collected observable interactions of an existing repair policy with a process to build a model of the process, the model mapping states of the process to repair actions of the existing repair policy; computing a new policy based upon the model, the new policy identifying a number of times to retry a first one of the repair actions when the process is in a first one of the states of the process; and applying the new policy to the process and, in an instance when the first state is identified, retrying the first repair action the number of times identified by the new policy before escalating the first repair action to a second one of the repair actions. - View Dependent Claims (22, 23, 24, 25)
-
-
26. One or more computer-readable storage devices comprising instructions which, when executed by one or more processing units, cause the one or more processing units to perform:
-
accessing collected observable interactions of an existing repair policy with a process to build a model of the process, the model mapping states of the process to repair actions of the existing repair policy; computing a new policy based upon the model, the new policy identifying a number of times to retry a first one of the repair actions when the process is in a first one of the states of the process; and applying the new policy to the process and, in an instance when the first state is identified, retrying the first repair action the number of times identified by the new policy before escalating the first repair action to a second one of the repair actions. - View Dependent Claims (27, 28, 30)
-
-
29. The one or more computer-readable storage devices according to claim 29, the model being computed based on a first action cost associated with the first repair action and a second action cost associated with the second repair action.
Specification