STATISTICALLY-BASED ANOMALY DETECTION IN UTILITY CLOUDS
First Claim
1. A method for detecting anomalies in a large scale and cloud datacenter, the method comprising:
- collecting sample data within a look-back window;
normalizing the sample data to generate normalized data;
binning the normalized data into a plurality of bins defined by bin indices;
calculating a Gini coefficient for the look-back window;
calculating a Gini standard deviation dependent threshold; and
comparing the Gini coefficient to the Gini standard deviation dependent threshold to detect an anomaly in the sample data.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for detecting anomalies in a large scale and cloud datacenter are disclosed. Anomaly detection is performed in an automated, statistical-based manner by using a parametric Gini coefficient technique or a non-parametric Tukey technique. In the parametric Gini coefficient technique, sample data is collected within a look-back window. The sample data is normalized to generate normalized data, which is binned into a plurality of bins defined by bin indices. A Gini coefficient and a threshold are calculated for the look-back window and the Gini coefficient is compared to the threshold to detect an anomaly in the sample data. In the non-parametric Tukey technique, collected sample data is divided into quartiles and compared to adjustable Tukey thresholds to detect anomalies in the sample data.
39 Citations
20 Claims
-
1. A method for detecting anomalies in a large scale and cloud datacenter, the method comprising:
-
collecting sample data within a look-back window; normalizing the sample data to generate normalized data; binning the normalized data into a plurality of bins defined by bin indices; calculating a Gini coefficient for the look-back window; calculating a Gini standard deviation dependent threshold; and comparing the Gini coefficient to the Gini standard deviation dependent threshold to detect an anomaly in the sample data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for detecting anomalies in a large scale and cloud datacenter, the system comprising:
-
a metrics collection module to collect metrics and monitoring data across the datacenter within a look-back window; a statistical-based anomaly detection module for detecting anomalies in the collected data, the statistical-based anomaly detection module comprising; a normalization module to generate normalized data from the collected data; a binning module to place the normalized data into a plurality of bins defined by bin indices; a Gini coefficient module to calculate a Gini coefficient for the look-back window; a threshold module to calculate a Gini standard deviation dependent threshold; and an anomaly alarm module to compare the Gini coefficient to the Gini standard deviation dependent threshold and generate an alarm when an anomaly in the collected data is detected; and a dashboard module to display the look-back window and the detected anomalies. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A system for detecting anomalies in a large scale and cloud datacenter, the system comprising:
-
a metrics collection module to collect metrics and monitoring data across the datacenter within a look-back window; a data quartile module to divide the collected data in quartiles; a Tukey threshold module to generate adjustable thresholds; and an anomaly alarm module to compare the collected data in the quartiles to the thresholds and generate an alarm when an anomaly in the collected data is detected. - View Dependent Claims (19, 20)
-
Specification