Proactive and adaptive cloud monitoring
First Claim
1. A process comprising:
- an active monitoring component receiving one or more metrics associated with each of a first active functional component and a second active functional component of a plurality of active functional components of a system, wherein the first active functional component contributes to a different functionality of the system than the second active functional component;
based at least in part on the one or more metrics associated with a particular active functional component of the first active functional component or the second active functional component, the active monitoring component determining that the particular active functional component has reached a likelihood of failure but has not failed;
in response to determining that the particular active functional component has reached the likelihood of failure but has not failed, the active monitoring component causing a set of one or more actions that are predicted to reduce the likelihood of failure;
wherein the likelihood of failure is a first likelihood of failure;
wherein the set of one or more actions is a first set of one or more actions;
the active monitoring component receiving one or more updated metrics associated with the particular active functional component;
based at least in part on the one or more updated metrics associated with the particular active functional component, the active monitoring component determining that the particular active functional component has reached a second likelihood of failure but has not failed, wherein the second likelihood of failure is greater than at the first likelihood of failure;
in response to determining that the particular active functional component has reached the second likelihood of failure, the active monitoring component causing a second set of one or more actions that are predicted to reduce the second likelihood of failure;
wherein, at any given likelihood of failure of the particular active functional component, the second set of one or more actions has a greater risk of causing the system to fail than the first set of one or more actions;
wherein the process is performed by one or more computing devices.
1 Assignment
0 Petitions
Accused Products
Abstract
Processes, computer-readable media, and machines are disclosed for reducing a likelihood that active functional components fail in a computing system. An active monitoring component receives metrics associated with different active functional components of a computing system. The different active functional components contribute to different functionalities of the system. Based at least in part on the metrics associated with a particular active functional component, the active monitoring component determines that the particular active functional component has reached a likelihood of failure but has not failed. In response to determining that the particular active functional component has reached the likelihood of failure but has not failed, the active monitoring component causes a set of actions that are predicted to reduce the likelihood of failure.
-
Citations
22 Claims
-
1. A process comprising:
-
an active monitoring component receiving one or more metrics associated with each of a first active functional component and a second active functional component of a plurality of active functional components of a system, wherein the first active functional component contributes to a different functionality of the system than the second active functional component; based at least in part on the one or more metrics associated with a particular active functional component of the first active functional component or the second active functional component, the active monitoring component determining that the particular active functional component has reached a likelihood of failure but has not failed; in response to determining that the particular active functional component has reached the likelihood of failure but has not failed, the active monitoring component causing a set of one or more actions that are predicted to reduce the likelihood of failure; wherein the likelihood of failure is a first likelihood of failure; wherein the set of one or more actions is a first set of one or more actions; the active monitoring component receiving one or more updated metrics associated with the particular active functional component; based at least in part on the one or more updated metrics associated with the particular active functional component, the active monitoring component determining that the particular active functional component has reached a second likelihood of failure but has not failed, wherein the second likelihood of failure is greater than at the first likelihood of failure; in response to determining that the particular active functional component has reached the second likelihood of failure, the active monitoring component causing a second set of one or more actions that are predicted to reduce the second likelihood of failure; wherein, at any given likelihood of failure of the particular active functional component, the second set of one or more actions has a greater risk of causing the system to fail than the first set of one or more actions; wherein the process is performed by one or more computing devices. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 21)
-
-
11. One or more non-transitory storage media storing instructions which, when executed by one or more computing devices, cause:
-
an active monitoring component receiving one or more metrics associated with each of a first active functional component and a second active functional component of a plurality of active functional components of a system, wherein the first active functional component contributes to a different functionality of the system than the second active functional component; based at least in part on the one or more metrics associated with a particular active functional component of the first active functional component or the second active functional component, the active monitoring component determining that the particular active functional component has reached a likelihood of failure but has not failed; in response to determining that the particular active functional component has reached the likelihood of failure but has not failed, the active monitoring component causing a set of one or more actions that are predicted to reduce the likelihood of failure; wherein the likelihood of failure is a first likelihood of failure; wherein the set of one or more actions is a first set of one or more actions; the active monitoring component receiving one or more updated metrics associated with the particular active functional component; based at least in part on the one or more updated metrics associated with the particular active functional component, the active monitoring component determining that the particular active functional component has reached a second likelihood of failure but has not failed, wherein the second likelihood of failure is greater than at the first likelihood of failure; in response to determining that the particular active functional component has reached the second likelihood of failure, the active monitoring component causing a second set of one or more actions that are predicted to reduce the second likelihood of failure; wherein, at any given likelihood of failure of the particular active functional component, the second set of one or more actions has a greater risk of causing the system to fail than the first set of one or more actions. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 22)
-
Specification