Workflow model for coordinating the recovery of IT outages based on integrated recovery plans
First Claim
1. A method of deploying a workflow model to coordinate recovery of an outage within an IT infrastructure through execution of an integrated recovery plan, comprising:
- initiating a workflow responsive to the occurrence of an outage event;
creating a processing context for the workflow;
adding data to the processing context throughout the workflow related to the outage event, outage type, failed resources within the IT infrastructure, status of certain resources within the IT infrastructure, recovery options supplied by an operational management product, and contact information for the operational management product;
executing the workflow, including performing the steps of;
requesting automated analysis of the outage event to enrich the processing context;
engaging one or more responsible parties to manually perform analysis of the outage event and select a recovery plan from a plurality of recovery plans;
obtaining approval of the selected recovery plan from one or more decision making parties defined within the selected recovery plan;
declaring a disaster by utilizing notification templates defined within the selected recovery plan;
implementing the selected recovery plan upon the IT infrastructure to recover from the outage event, including executing recovery options within the selected recovery plan;
verifying recovery to the outage event within the IT infrastructure by performing verifications upon results of the selected recovery plan implementation; and
declaring recovery complete responsive to obtaining notifications produced by execution of the selected recovery plan; and
analyzing the processing context to obtain post mortem analysis of recovery to the outage event with the selected recovery plan and to create continuous service improvements for the IT infrastructure.
1 Assignment
0 Petitions
Accused Products
Abstract
One aspect of the present invention provides a workflow model to effectively respond to outage events within an IT infrastructure. This workflow model enables a combination of manual and automated processing to effectively deploy a flexible, plannable, and testable recovery to outages and problems encountered within IT infrastructure settings. In one embodiment, a shared processing context is created to accompany the operations of the workflow, thereby collecting useful data in one location related to events and status information during the outage and the outage response. Within the workflow, analysis of the outage event is performed, an appropriate recovery plan is selected, the selected recovery plan is implemented, and recovery to the outage event is completed. Data collected within the processing context can be analyzed to obtain post mortem analysis and continuous service improvements. Accordingly, the improvements can be implemented within the IT infrastructure directly or within the appropriate recovery plan.
-
Citations
3 Claims
-
1. A method of deploying a workflow model to coordinate recovery of an outage within an IT infrastructure through execution of an integrated recovery plan, comprising:
-
initiating a workflow responsive to the occurrence of an outage event; creating a processing context for the workflow; adding data to the processing context throughout the workflow related to the outage event, outage type, failed resources within the IT infrastructure, status of certain resources within the IT infrastructure, recovery options supplied by an operational management product, and contact information for the operational management product; executing the workflow, including performing the steps of; requesting automated analysis of the outage event to enrich the processing context; engaging one or more responsible parties to manually perform analysis of the outage event and select a recovery plan from a plurality of recovery plans; obtaining approval of the selected recovery plan from one or more decision making parties defined within the selected recovery plan; declaring a disaster by utilizing notification templates defined within the selected recovery plan; implementing the selected recovery plan upon the IT infrastructure to recover from the outage event, including executing recovery options within the selected recovery plan; verifying recovery to the outage event within the IT infrastructure by performing verifications upon results of the selected recovery plan implementation; and declaring recovery complete responsive to obtaining notifications produced by execution of the selected recovery plan; and analyzing the processing context to obtain post mortem analysis of recovery to the outage event with the selected recovery plan and to create continuous service improvements for the IT infrastructure.
-
-
2. A system for deploying a workflow model to coordinate recovery of an outage within an IT infrastructure through execution of an integrated recovery plan, the system comprising a data processor coupled to a memory that includes instructions that are operable by the data processor to perform steps of:
-
initiating a workflow responsive to the occurrence of an outage event, wherein the data related to the outage event comprises outage type, failed resources within the IT infrastructure, status of certain resources within the IT infrastructure, recovery options supplied by an operational management product, and contact information for the operational management product; creating a processing context for the workflow; executing the workflow; responsive to executing the workflow, adding data to the processing context related to the outage event; and analyzing the processing context to obtain post mortem analysis of recovery to the outage event with a recovery plan.
-
-
3. A computer program product comprising, a non-transitory computer readable storage device having stored therein instructions that are operable to coordinate recovery of an outage within an IT infrastructure through execution of an integrated recovery plan, wherein the instructions are operable to perform steps of:
-
initiating a workflow responsive to the occurrence of an outage event, wherein the data related to the outage event comprises outage type, failed resources within the IT infrastructure, status of certain resources within the IT infrastructure, recovery options supplied by an operational management product, and contact information for the operational management product; creating a processing context for the workflow; executing the workflow; responsive to executing the workflow, adding data to the processing context related to the outage event; and analyzing the processing context to obtain post mortem analysis of recovery to the outage event with a recovery plan.
-
Specification