METHOD FOR PERFORMING A CORRECTIVE ACTION UPON A SUB-SYSTEM
First Claim
1. A method for monitoring the health of a server comprising:
- maintaining a server having a sub-system and a server self health monitor;
detecting the occurrence of a health check event by the server self health monitor;
transmitting a request by the server self health monitor to the sub-system for sub-system health information;
determining the health of the server by the server self health monitor using the sub-system health information; and
performing a corrective action upon the sub-system, by the server health monitor or the sub-system, wherein the corrective action is based on the health of the sub-system; and
wherein a first parameter specifies the maximum number of times a server can be restarted within a period of time specified by a second parameter.
1 Assignment
0 Petitions
Accused Products
Abstract
A server self health monitor (SHM) system monitors the health of the server it resides on. The health of a server is determined by the health of all of a server'"'"'s sub-systems and deployed applications. The SHM may make health check inquiries to server sub-systems periodically or based on external trigger events. The sub-systems perform self health checks on themselves and provide sub-system health information to requesting entities such as the SHM. Sub-systems self health updates may be based on internal events such as counters or changes in status or based on external entity requests. Corrective action may be performed upon sub-systems by the SHM depending on their health status or the health status of the server. Corrective action may also be performed by a sub-system upon itself.
-
Citations
20 Claims
-
1. A method for monitoring the health of a server comprising:
-
maintaining a server having a sub-system and a server self health monitor; detecting the occurrence of a health check event by the server self health monitor; transmitting a request by the server self health monitor to the sub-system for sub-system health information; determining the health of the server by the server self health monitor using the sub-system health information; and performing a corrective action upon the sub-system, by the server health monitor or the sub-system, wherein the corrective action is based on the health of the sub-system; and wherein a first parameter specifies the maximum number of times a server can be restarted within a period of time specified by a second parameter. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 19, 20)
-
-
18. The method of claim 18, wherein the sub-system is set to a critical level if a minimum number of transactions have timed out.
Specification