×

Defining a computer recovery process that matches the scope of outage including determining a root cause and performing escalated recovery operations

  • US 8,826,077 B2
  • Filed: 12/28/2007
  • Issued: 09/02/2014
  • Est. Priority Date: 12/28/2007
  • Status: Active Grant
First Claim
Patent Images

1. A computer program product for facilitating recovery in an Information Technology (IT) environment, said computer program product comprising:

  • a non-transitory computer readable storage medium readable by a processing circuit and storing instructions for execution by the processing circuit for performing a method comprising;

    programmatically analyzing, at failure time, information relating to a failure within the IT environment to determine which resource of a plurality of resources is the resource corresponding to a root cause of the failure, said information being related to at least one of;

    one or more resources impacted by the failure, one or more implications of the failure, or one or more resources degraded by the failure, wherein the programmatically analyzing comprises iteratively analyzing the information to determine which resource is the resource corresponding to the root cause;

    programmatically determining, at failure time, based on the programmatically analyzing, the root cause for the failure; and

    programmatically defining, at failure time, by a processor, one or more resources to be included in a set of resources to be recovered and one or more operations to be used in recovering the set of resources based on the analyzed information and the determined root cause, wherein said programmatically defining comprises;

    determining, at failure time, the one or more resources affected by the failure, the determining based on the analyzed information and the root cause;

    including the one or more resources determined at failure time to be affected by the failure in the set of resources to be recovered, wherein the set of resources is commensurate with a scope of the failure, as determined at failure time, in that the set of resources includes only those resources affected by the failure; and

    determining one or more operations to be performed on the set of resources, wherein the determining takes into consideration at least one of;

    an effect an operation has on a resource of the set of resources on which the operation is performed, an impact on at least one other resource of the set of resources, or a time it takes to perform the operation, wherein the determining the one or more operations is iterative, and wherein an operation selected to be used in recovery is an escalated operation having an increased severity, in response to a previous operation failing.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×