Method and apparatus for efficient reactive monitoring
DCFirst Claim
1. A method for monitoring usage of resources allocated to a plurality of nodes of a network, comprising the steps of:
- assigning a parameter to each of a plurality of nodes of the network, wherein each parameter is indicative of a rate of change of usage of said resources of the node;
locally monitoring, at each of the nodes, the rate of change of the usage of said resources of the node;
reporting to a centralized management station of the network when the rate of change of the usage of the resources of one of the nodes exceeds a first threshold;
initiating a poll of resources of nodes of the network by the centralized management station in response to reporting from the node or a time interval being exceeded;
determining whether a sum of the currently reported rates of change of usage of node resources, received in response to the poll initiated by the management station, exceeds a second threshold; and
generating an alarm if the sum of the currently reported rates of change of usage of node resources exceeds the second threshold, else updating the time interval.
11 Assignments
Litigations
0 Petitions
Accused Products
Abstract
A technique for managing network elements significantly reduces the amount of monitoring related traffic by using a combination of aperiodic polling and asynchronous event reporting. A global resource (e.g., a network of interconnected nodes or resources) is partitioned into a plurality of separate nodes, giving a fixed resource budget to each of the nodes. When any of the nodes exceeds its budget, based upon local monitoring at that node, the node triggers a report, typically sending a message to a central manager. In response, the central manager then and only then issues a global poll of all (or substantially all) of the nodes in the network. A rate based technique can also be used to monitor resource usage at the nodes, and send a message to a central monitoring location only when the rate at which the value of a local variable changes is too high.
21 Citations
9 Claims
-
1. A method for monitoring usage of resources allocated to a plurality of nodes of a network, comprising the steps of:
-
assigning a parameter to each of a plurality of nodes of the network, wherein each parameter is indicative of a rate of change of usage of said resources of the node; locally monitoring, at each of the nodes, the rate of change of the usage of said resources of the node; reporting to a centralized management station of the network when the rate of change of the usage of the resources of one of the nodes exceeds a first threshold; initiating a poll of resources of nodes of the network by the centralized management station in response to reporting from the node or a time interval being exceeded; determining whether a sum of the currently reported rates of change of usage of node resources, received in response to the poll initiated by the management station, exceeds a second threshold; and generating an alarm if the sum of the currently reported rates of change of usage of node resources exceeds the second threshold, else updating the time interval. - View Dependent Claims (2)
-
-
3. A method for monitoring usage of a resource in nodes of a network, comprising the steps of:
-
(a) monitoring usage of the resource in a node to determine when a rate of change of the usage exceeds a first predetermined threshold; (b) reporting to a management station of the network when the rate of change of the usage exceeds said first predetermined threshold; and (c) initiating a poll of resources in the nodes of the network by the management station in response to reporting from the node or a time interval being exceeded. - View Dependent Claims (9)
-
-
4. A method for monitoring usage of resources in nodes of a network, comprising the steps of:
-
asynchronous reporting of an event to a management station of the network when a rate of change of a usage of at least one resource of said resources in any of said nodes deviates from a prescribed norm; and periodic polling of said nodes in accordance with a polling interval, and aperiodic polling of said nodes in response to reporting of said event, wherein a tunable parameter is adjusted in response to the usage. - View Dependent Claims (7, 8)
-
-
5. A method for managing a global resource of a network in order to reduce the amount of monitoring related traffic, comprising the steps of:
-
assigning a local threshold to each of a plurality of node resources of a respective plurality of nodes of the network; reporting to a management station of the network when a value indicative of node resource usage exceeds the assigned local threshold as determined using local monitoring of the node resource; initiating a poll, by the management station, of node resource usage by the nodes of the network in response to a determination that a sum of previously reported values indicative of node resource usage received from reporting nodes plus an upper bound of node resource usage for non-reporting nodes exceeds a threshold; and generating an alarm if the sum of the currently reported values indicative of node resource usage, received in response to the poll initiated by the management station, exceeds the threshold.
-
-
6. A method for managing a global resource of a network in order to reduce the amount of monitoring related traffic, comprising the steps of:
-
assigning a local threshold to each of a plurality of node resources of a respective plurality of nodes of the network; reporting to a management station of the network when a rate of change of usage of said node resource exceeds the local threshold as determined using local monitoring of the node resource, wherein said rate of change of usage of said node resource is determined using a variable time interval comprising a difference between a current time and a time at which the node was last polled by the management station; initiating a poll, by the management station, of the node resource usage of the nodes of the network in response to receiving reporting from one of the nodes or a time interval being exceeded; determining whether a sum of the currently reported rates of change of usage of node resources, received in response to the poll initiated by the management station, exceeds a threshold; and generating an alarm if the sum of the currently reported rates of change of usage of node resources exceeds the threshold.
-
Specification