Auto-tune anomaly detection
First Claim
1. A method comprising:
- storing training data that comprises a plurality of training instances, each of which comprises a severity-duration pair and a label that indicates whether the severity-duration pair represents an anomaly;
using one or more machine learning techniques to train a model based on a first subset of the training data;
identifying a second subset of the training data, wherein each training instance in the second subset includes a positive label that indicates that said each training instance represents an anomaly;
based on the second subset of the training data, generating, using the model, a plurality of scores, wherein each score corresponds to a different training instance in the second subset;
identifying a minimum score of the plurality of scores that ensures a particular recall rate relative to training instances in the second subset;
in response to receiving a particular severity-duration pair, using the model to generate a particular score for the particular severity-duration pair;
generating a notification of an anomaly if the particular score is greater than the minimum score;
wherein the method is performed by one or more computing devices.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for auto-tuning anomaly detection are provided. In one technique, training data is stored that comprises training instances, each of which comprises a severity-duration pair and a label that indicates whether the severity-duration pair represents an anomaly. A model is trained based on a first subset of the training data. A second subset of the training data is identified where each training instance includes a positive label that indicates that that training instance represents an anomaly. Based on the second subset of the training data, the model generates multiple scores, each of which corresponds to a different training instance. A minimum score is identified that ensures a particular recall rate of the model. In response to receiving a particular severity-duration pair, the model generates a particular score for the particular severity-duration pair. A notification of an anomaly is generated if the particular score is greater than the minimum score.
-
Citations
20 Claims
-
1. A method comprising:
-
storing training data that comprises a plurality of training instances, each of which comprises a severity-duration pair and a label that indicates whether the severity-duration pair represents an anomaly; using one or more machine learning techniques to train a model based on a first subset of the training data; identifying a second subset of the training data, wherein each training instance in the second subset includes a positive label that indicates that said each training instance represents an anomaly; based on the second subset of the training data, generating, using the model, a plurality of scores, wherein each score corresponds to a different training instance in the second subset; identifying a minimum score of the plurality of scores that ensures a particular recall rate relative to training instances in the second subset; in response to receiving a particular severity-duration pair, using the model to generate a particular score for the particular severity-duration pair; generating a notification of an anomaly if the particular score is greater than the minimum score; wherein the method is performed by one or more computing devices. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. One or more storage media storing instructions which, when executed by one or more processors, cause:
-
storing training data that comprises a plurality of training instances, each of which comprises a severity-duration pair and a label that indicates whether the severity-duration pair represents an anomaly; using one or more machine learning techniques to train a model based on at least a portion of the training data; in response to receiving a particular severity-duration pair, using the model to generate a particular score for the particular severity-duration pair; generating a notification of an anomaly if the particular score is greater than a particular threshold; receiving, from a remote computing device, user-provided anomaly data; in response to receiving the user-provided anomaly data and based on the user-provided anomaly data, generating a new training instance that comprises a particular severity, a particular duration, and a particular label that indicates whether the particular severity and the particular duration represents an anomaly; using the one or more machine learning techniques to train an updated model based on the new training instance. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification