Failure simulation and availability report on same
First Claim
1. A method of evaluating failure tolerance, the method comprising:
- receiving a specification of a simulation to be performed, said simulation comprising a behavior;
dispatching, to an agent installed at a computer, a request to induce said behavior at said computer;
receiving, from said agent, data collected at said computer; and
generating a report, based on said data, that comprises a first indication of performance during occurrence of said behavior.
2 Assignments
0 Petitions
Accused Products
Abstract
A simulation may be performed to assess availability of a service in the event of various types of failures. A specification may describe a behavior to be induced on a machine, and a formula under which availability is to be measured. A controller processes the specification and dispatches a request to induce the behavior to a machine on which the behavior is to be induced. An agent on that machine may look up a routine in a simulation library and may use the routine to induce the behavior. Data, such as performance counter values, are collected before, during, and/or after occurrence of the behavior, and are transmitted back to the controller. A report generator applies the formula contained in the specification to the data received to calculate the availability metric, and produces a report containing that metric.
20 Citations
20 Claims
-
1. A method of evaluating failure tolerance, the method comprising:
-
receiving a specification of a simulation to be performed, said simulation comprising a behavior; dispatching, to an agent installed at a computer, a request to induce said behavior at said computer; receiving, from said agent, data collected at said computer; and generating a report, based on said data, that comprises a first indication of performance during occurrence of said behavior. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. One or more computer-readable storage media comprising executable instructions to perform a method, the method comprising:
-
receiving, from a controller that operates at a first machine, a request to induce a behavior at a second machine on which said instructions execute; recording a first value of a performance counter prior to said behavior; inducing said behavior in response to said request; recording a second value of said performance counter prior to said behavior and a third value of said performance counter after said behavior has ended; communicating, to said controller, data that comprises said first value, said second value, and said third value. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. A system comprising:
-
a first computer on which a controller is installed; a first instance of an agent that induces, on a second computer on which said first instance of said agent is installed, a first behavior in response to a first request from said controller, and that collects data about performance on said second computer; a report generator that receives said data and that generates, based on information that comprises said data, a report that describes availability of a service provided by a group of one or more computers that include said second computer; and an output device that is comprised by, or connected to, said first computer, said report generator communicating said report through said output device. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification