MONITORING AND RESOLVING DEADLOCKS, CONTENTION, RUNAWAY CPU AND OTHER VIRTUAL MACHINE PRODUCTION ISSUES
First Claim
1. A computer-implemented method for resolving a computing system issue, comprising:
- monitoring a set of computing system health status metrics of a system at a first level;
analyzing data of the monitored computing system health status metrics to determine that an instability has occurred when the data exceeds defined bounds for the computing system health status metrics;
responding to the instability by monitoring additional computing system health status metrics, whereby a level of monitoring of the system is increased from the first level to a second level, greater than the first level;
identifying the instability;
repairing the computing system by taking corrective action based on the identified instability; and
removing at least one of the set of monitoring and profiling tools to reduce the level of monitoring to a third level once the instability has been resolved, wherein the third level is less than the second level.
1 Assignment
0 Petitions
Accused Products
Abstract
Resolving virtual machine (VM) issues, by executing VM and operating system (OS) diagnostic monitors, including, monitoring a set of VM and OS health status metrics of a system at a first level, analyzing data of the monitored health status metrics to determine that an instability has occurred when the data exceeds defined bounds for the health status metrics, responding to the instability by monitoring additional VM and OS health status metrics, whereby a level of monitoring of the system is increased from the first level to a second level, greater than the first level, identifying the instability, repairing the system by taking corrective action based on the identified instability; and removing at least one of the set of monitoring and profiling tools to reduce the level of monitoring to a third level once the instability has been resolved, wherein the third level is less than the second level.
18 Citations
8 Claims
-
1. A computer-implemented method for resolving a computing system issue, comprising:
-
monitoring a set of computing system health status metrics of a system at a first level; analyzing data of the monitored computing system health status metrics to determine that an instability has occurred when the data exceeds defined bounds for the computing system health status metrics; responding to the instability by monitoring additional computing system health status metrics, whereby a level of monitoring of the system is increased from the first level to a second level, greater than the first level; identifying the instability; repairing the computing system by taking corrective action based on the identified instability; and removing at least one of the set of monitoring and profiling tools to reduce the level of monitoring to a third level once the instability has been resolved, wherein the third level is less than the second level. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
Specification