Continuous system health indicator for managing computer system alerts
First Claim
1. A computer-implemented method for measuring health of a computer system over a time interval comprising steps of:
- determining, based on an activity measurement value and a set of time measurement values, a health indicator value that indicates the health of the computer system over the time interval;
wherein the activity measurement value is based on a number of sessions for which the computer system is actively performing work during the time interval;
wherein each time measurement value of the set of time measurement values is associated with a wait class of a plurality of wait classes and represents an amount of time during which said work performed by the computer system spends waiting in one of a plurality of wait classes;
determining an activity weighting factor based on a first scaling function of the activity measurement value, wherein the first scaling function scales the activity measurement value to a value between 0 and a first pre-determined upper bound;
determining an amount of work waiting on each wait class of the plurality of wait classes;
determining, based on a second scaling function of the amount of work waiting on said each wait class of the plurality of wait classes, a set of user impact weighting factors associated with said each wait class, wherein the second scaling function scales the amount of work waiting on said each wait class to a number between 0 and a second pre-determined upper bound; and
one or more computing devices computing the health indicator value over the time interval as a function of the set of time measurement values, the amount of work waiting on said each wait class of the plurality of wait classes, the set of user impact weighting factors, and the activity weighting factor.
1 Assignment
0 Petitions
Accused Products
Abstract
A method is provided for detecting when users are being adversely impacted by poor system performance. A system health indicator is determined that is based on the amount of work that is blocked waiting for each of a set of an external events and combined with a heuristic that is based on the number of users waiting for the work to complete. The system health indicator is compared to a threshold such that an alert is generated when the system health indicator crosses the threshold. However, the system health indicator is designed so that an alert is only generated when a significant user base is or will in the near future experience a problem with the system. Furthermore, the system health indicator is designed to vary smoothly to maintain its suitability for the application of predictive technology.
200 Citations
22 Claims
-
1. A computer-implemented method for measuring health of a computer system over a time interval comprising steps of:
-
determining, based on an activity measurement value and a set of time measurement values, a health indicator value that indicates the health of the computer system over the time interval; wherein the activity measurement value is based on a number of sessions for which the computer system is actively performing work during the time interval; wherein each time measurement value of the set of time measurement values is associated with a wait class of a plurality of wait classes and represents an amount of time during which said work performed by the computer system spends waiting in one of a plurality of wait classes; determining an activity weighting factor based on a first scaling function of the activity measurement value, wherein the first scaling function scales the activity measurement value to a value between 0 and a first pre-determined upper bound; determining an amount of work waiting on each wait class of the plurality of wait classes; determining, based on a second scaling function of the amount of work waiting on said each wait class of the plurality of wait classes, a set of user impact weighting factors associated with said each wait class, wherein the second scaling function scales the amount of work waiting on said each wait class to a number between 0 and a second pre-determined upper bound; and one or more computing devices computing the health indicator value over the time interval as a function of the set of time measurement values, the amount of work waiting on said each wait class of the plurality of wait classes, the set of user impact weighting factors, and the activity weighting factor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A non-transitory computer-readable medium storing one or more instructions which when executed by one or more processors cause the one or more processors to perform steps comprising:
-
determining, based on an activity measurement value and a set of time measurement values, a health indicator value that indicates the health of the computer system over the time interval; wherein the activity measurement value is based on a number of sessions for which the computer system is actively performing work during the time interval; wherein each time measurement value of the set of time measurement values is associated with a wait class of a plurality of wait classes and represents an amount of time during which said work performed by the computer system spends waiting in one of a plurality of wait classes; determining an activity weighting factor based on a first scaling function of the activity measurement value, wherein the first scaling function scales the activity measurement value to a value between 0 and a first pre-determined upper bound; determining an amount of work waiting on each wait class of the plurality of wait classes; determining, based on a second scaling function of the amount of work waiting on said each wait class of the plurality of wait classes, a set of user impact weighting factors associated with said each wait class, wherein the second scaling function scales the amount of work waiting on said each wait class to a number between 0 and a second pre-determined upper bound; and one or more computing devices computing the health indicator value over the time interval as a function of the set of time measurement values, the amount of work waiting on said each wait class of the plurality of wait classes, the set of user impact weighting factors, and the activity weighting factor. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification