Generating an operational definition of baseline for monitoring network traffic data
First Claim
1. A method executed on a computer including at least one processor for establishing a baseline of network traffic data, the method comprising:
- generating a configurable static window for grouping said network traffic data;
extracting, grouping and ordering said network traffic data based on the configurable static window;
generating a dynamic window for clustering said groups of network traffic data based on statistical similarity of groups;
clustering said groups of network traffic data based on the dynamic window;
determining a transformation function from historical network traffic data using feedback control, the transformation function making a data set having a normal distribution or a nearly normal distribution;
generating transformed network traffic data associated with a distribution having the normal distribution or having the nearly normal distribution by applying the transformation function to said network traffic data within a cluster to map the network traffic data within the cluster into the data set having the normal distribution or having a nearly normal distribution;
calculating an error indicating a similarity between the distribution associated with the transformed network traffic data and the normal distribution, and responsive to the error equaling or exceeding a threshold, modifying the transformation function to modify the distribution associated with the transformed network traffic data to reduce the error; and
generating a baseline of network traffic data for each cluster from the transformed network traffic data.
3 Assignments
0 Petitions
Accused Products
Abstract
A system and a method are disclosed for establishing a baseline and the corresponding bands of data for alarming, etc. Historical raw data are aggregated and grouped. For example, the data may be and hourly grouped as 168 groups of data in a weekly frame. Clusters of the groups of data are then formed based on dynamic data window by analyzing statistical similarity among the 168 groups of data. Data in each cluster of groups, originated from the raw data at specific hour(s) of day on specific day(s) of week, are used as historical data to predict a baseline and the envelopes at these associated hour(s) and day(s). Generating a baseline includes determining a mapping function, which transforms data in a cluster to become normal or nearly normal. A mean and standard deviation of the transformed data are calculated. Envelopes are determined using the mean and the standard deviation. An inverse transformation function is uniquely derived. The mean and the envelopes are inversely transformed using the inverse function. This operationally decides a baseline and the corresponding bands for every weekly time frame hour.
-
Citations
24 Claims
-
1. A method executed on a computer including at least one processor for establishing a baseline of network traffic data, the method comprising:
-
generating a configurable static window for grouping said network traffic data; extracting, grouping and ordering said network traffic data based on the configurable static window; generating a dynamic window for clustering said groups of network traffic data based on statistical similarity of groups; clustering said groups of network traffic data based on the dynamic window; determining a transformation function from historical network traffic data using feedback control, the transformation function making a data set having a normal distribution or a nearly normal distribution; generating transformed network traffic data associated with a distribution having the normal distribution or having the nearly normal distribution by applying the transformation function to said network traffic data within a cluster to map the network traffic data within the cluster into the data set having the normal distribution or having a nearly normal distribution; calculating an error indicating a similarity between the distribution associated with the transformed network traffic data and the normal distribution, and responsive to the error equaling or exceeding a threshold, modifying the transformation function to modify the distribution associated with the transformed network traffic data to reduce the error; and generating a baseline of network traffic data for each cluster from the transformed network traffic data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer program product for use in conjunction with a computer system, the computer program product comprising a non-transitory computer readable storage medium including a computer program mechanism executable by a processor embedded therein, the computer program mechanism including:
-
instructions for generating a configurable static window for grouping network traffic data; instructions for extracting, grouping and ordering said network traffic data based on the configurable static window; instructions for generating a dynamic window for clustering said groups of network traffic data based on statistical similarity of groups; instructions for clustering said groups of network traffic data based on the dynamic window; instructions for determining a transformation function from historical network traffic data using feedback control, the transformation function making a data set having a normal distribution or a nearly normal distribution; instructions for generating transformed network traffic data associated with a distribution having the normal or having the nearly normal distribution by applying the transformation function to said network traffic data within a cluster to map the network traffic data within the cluster to the data set having the normal distribution or having the nearly normal distribution; instructions for calculating an error indicating a similarity between the distribution associated with the transformed network traffic data and the normal distribution, and responsive to the error equaling or exceeding a threshold, modifying the transformation function to modify the distribution associated with the transformed network traffic data to reduce the error; and instructions for generating a baseline of network traffic data for each cluster from the transformed network traffic data. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A system comprising:
-
a processor for executing programs; a module executable by the processor, the module including; instructions for generating a configurable static window for grouping network traffic data; instructions for extracting, grouping and ordering said network traffic data based on the configurable static window; instructions for generating a dynamic window for clustering said groups of network traffic data based on statistical similarity of groups; instructions for clustering said groups of network traffic data based on the dynamic window; instructions for determining a transformation function from historical network traffic data using feedback control, the transformation function making a data set having a normal distribution or a nearly normal distribution; instructions for generating transformed network traffic data associated with a distribution having the normal distribution or having the nearly normal distribution by applying the transformation function to said network traffic data within a cluster to map the network traffic data within the cluster into the data set having the normal distribution or having the nearly normal distribution; instructions for calculating an error indicating a similarity between the distribution associated with the transformed network traffic data and the normal distribution, and responsive to the error equaling or exceeding a threshold, modifying the transformation function to modify the distribution associated with the transformed network traffic data to reduce the error; and instructions for generating a baseline of network traffic data for each cluster from the transformed network traffic data. - View Dependent Claims (21, 22, 23, 24)
-
Specification