Method and system for using web analytics data for detecting anomalies
First Claim
1. A computer-implemented method for identifying events of interest in web analytics data, comprising:
- at a computer server having one or more processors and memory for storing programs to be executed by the one or more processors;
storing web analytics data for a web page in a device, wherein the web analytics data comprises a plurality of prior time-value pairs, each time-value pair including a value of one of a plurality of attributes associated with the web page and a time associated with the value;
collecting a new time-value pair for the particular attribute, the new time-value pair including a new value associated with the web page and a new time when the value was determined;
estimating a predicted value for the attribute and associated error-variance at the new time for the predicted value by applying a forecasting model to the plurality of prior time-value pairs in respective subsets of the web analytics data; and
tagging the collected new time-value pair if the value of the new time-value pair is outside the error variance of the predicted value of the forecasting model for the particular attribute;
adding to the collected web analytics data for the web page the new time-value pair including a tag indicating whether the new value is an anomaly and a significance factor if the new value is an anomaly, wherein the significance factor is associated with the error variance of the predicted value of the forecasting model;
for the new time-value pair and the particular attribute, determining the significance factor such that, when the error-variance is multiplied by the significance factor, the value of the time-value pair is inside the factored error-variance of the predicted value; and
in response to a request from a client application for analytics information, the request including a significance threshold for one or more of the attributes, reporting to the client application the new time-value pair if the significance factor for the time-value pair exceeds the significance threshold.
2 Assignments
0 Petitions
Accused Products
Abstract
A server system stores web analytics data for a web page in a device. The web analytics data comprises a plurality of prior time-value pairs, each pair including a value of an attribute associated with the web page and a time associated with the value. For a particular attribute, the server system collects a new time-value pair including a new value associated with the web page and a new time indicating when the value was determined. The server system estimates a predicted value for the attribute and an associated error-variance at the new time by applying a forecasting model to the prior time-value pairs in respective subsets of the web analytics data. The collected new time-value pair is tagged if its value is outside the error variance of the predicted value for the particular attribute.
52 Citations
21 Claims
-
1. A computer-implemented method for identifying events of interest in web analytics data, comprising:
at a computer server having one or more processors and memory for storing programs to be executed by the one or more processors; storing web analytics data for a web page in a device, wherein the web analytics data comprises a plurality of prior time-value pairs, each time-value pair including a value of one of a plurality of attributes associated with the web page and a time associated with the value; collecting a new time-value pair for the particular attribute, the new time-value pair including a new value associated with the web page and a new time when the value was determined; estimating a predicted value for the attribute and associated error-variance at the new time for the predicted value by applying a forecasting model to the plurality of prior time-value pairs in respective subsets of the web analytics data; and tagging the collected new time-value pair if the value of the new time-value pair is outside the error variance of the predicted value of the forecasting model for the particular attribute; adding to the collected web analytics data for the web page the new time-value pair including a tag indicating whether the new value is an anomaly and a significance factor if the new value is an anomaly, wherein the significance factor is associated with the error variance of the predicted value of the forecasting model; for the new time-value pair and the particular attribute, determining the significance factor such that, when the error-variance is multiplied by the significance factor, the value of the time-value pair is inside the factored error-variance of the predicted value; and in response to a request from a client application for analytics information, the request including a significance threshold for one or more of the attributes, reporting to the client application the new time-value pair if the significance factor for the time-value pair exceeds the significance threshold. - View Dependent Claims (2, 3, 4, 5, 20)
-
6. A server system for identifying anomalies in web analytics data, wherein the server system is connected to one or more client devices through a network, comprising:
-
one or more processors for executing programs; and memory to store data and to store one or more programs to be executed by the one or more processors, the one or more programs including instructions for; storing web analytics data for a web page in a device, wherein the web analytics data comprises a plurality of prior time-value pairs, each time-value pair including a value of one of a plurality of attributes associated with the web page and a time associated with the value; collecting a new time-value pair for the particular attribute, the new time-value pair including a new value associated with the web page and a new time when the value was determined; estimating a predicted value for the attribute and associated error-variance at the new time for the predicted value by applying a forecasting model to the plurality of prior time-value pairs in respective subsets of the web analytics data; tagging the collected new time-value pair if the value of the new time-value pair is outside the error variance of the predicted value of the forecasting model for the particular attribute; adding to the collected web analytics data for the web page the new time-value pair including a tag indicating whether the new value is an anomaly and a significance factor if the new value is an anomaly, wherein the significance factor is associated with the error variance of the predicted value of the forecasting model; for the new time-value pair and the particular attribute, determining the significance factor such that, when the error-variance is multiplied by the significance factor, the value of the time-value pair is inside the factored error-variance of the predicted value; and in response to a request from a client application for analytics information, the request including a significance threshold for one or more of the attributes, reporting to the client application the new time-value pair if the significance factor for the time-value pair exceeds the significance threshold. - View Dependent Claims (7, 8, 9)
-
-
10. A non-transitory computer readable-storage medium storing one or more programs for execution by one or more processors, the one or more programs comprising instructions for:
-
storing web analytics data for a web page in a device, wherein the web analytics data comprises a plurality of prior time-value pairs, each time-value pair including a value of one of a plurality of attributes associated with the web page and a time associated with the value; collecting a new time-value pair for the particular attribute, the new time-value pair including a new value associated with the web page and a new time when the value was determined; estimating a predicted value for the attribute and associated error-variance at the new time for the predicted value by applying a forecasting model to the plurality of prior time value pairs in respective subsets of the web analytics data; and tagging the collected new time-value pair if the value of the new time-value pair is outside the error variance of the predicted value of the forecasting model for the particular attribute; adding to the collected web analytics data for the web page the new time-value pair including a tag indicating whether the new value is an anomaly and a significance factor if the new value is an anomaly, wherein the significance factor is associated with the error variance of the predicted value of the forecasting model for the new time-value pair and the particular attribute, determining the significance factor such that, when the error-variance is multiplied by the significance factor, the value of the time-value pair is inside the factored error-variance of the predicted value; and in response to a request from a client application for analytics information, the request including a significance threshold for one or more of the attributes, reporting to the client application the new time-value pair if the significance factor for the time-value pair exceeds the significance threshold. - View Dependent Claims (11, 12, 13)
-
-
14. A computer-implemented method for identifying events in web analytics data, comprising:
-
at a computer server having one or more processors and memory for storing programs to be executed by the one or more processors; storing web analytics data for a web page in a device, wherein the web analytics data comprises a plurality of prior time-value pairs, each time-value pair including a value of one of a plurality of attributes associated with the web page and a time associated with the value; collecting a new time-value pair for a particular attribute, the new time-value pair including a new value associated with the web page and a new time when the value was determined; estimating a predicted value for the attribute at the new time by applying a forecasting model to the plurality of prior time-value pairs in a respective subset of the web analytics data; tagging the collected new time-value pair as an event of potential interest when a difference between the value of the new time-value pair and the predicted value at the new time meets a predefined condition, wherein the predefined condition comprises the value of the new time-value pair being outside a range centered at the predicted value and the range is determined, at least in part, by a significance factor associated with the forecasting model; and generating an alert for the event of potential interest when an alert condition based on the value of the new time-value pair is satisfied. - View Dependent Claims (15, 16, 17, 18, 19, 21)
-
Specification