Assessing detectability of malware related traffic
First Claim
1. A method, comprising:
- training, by a computing device, a multi-class classifier on a training dataset, the multi-class classifier having a plurality of classes;
evaluating, by the computing device, the multi-class classifier on a testing dataset to determine a performance of each class of the plurality of classes of the multi-class classifier;
partitioning, by the computing device, the plurality of classes into either learnable or unlearnable based on whether the performance each particular class surpasses a particular threshold;
training, by the computing device, a predicting classifier on the training dataset, wherein data of the training dataset is labelled as either learnable or unlearnable based on the particular class to which the data corresponds;
using, by the computing device, the predicting classifier on a new class to predict whether samples associated with the new class are learnable or unlearnable; and
retraining, by the computing device, the multi-class classifier with the samples associated with the new class in response to predicting that the samples are learnable.
1 Assignment
0 Petitions
Accused Products
Abstract
In one embodiment, a computing device trains a multi-class classifier (having a plurality of classes) on a training dataset, and evaluates the multi-class classifier on a testing dataset to determine a performance of each of the plurality of classes. The plurality of classes may then be partitioned into either learnable or unlearnable based on whether the performance each particular class surpasses a particular threshold, and then a predicting classifier can be trained on the training dataset, where data of the training dataset is labelled as either learnable or unlearnable based on the particular class to which the data corresponds. Accordingly, the computing device may then use the predicting classifier on a new class to predict whether samples associated with the new class are learnable or unlearnable, and may retrain the multi-class classifier with the samples associated with the new class in response to predicting that the samples are learnable.
9 Citations
20 Claims
-
1. A method, comprising:
-
training, by a computing device, a multi-class classifier on a training dataset, the multi-class classifier having a plurality of classes; evaluating, by the computing device, the multi-class classifier on a testing dataset to determine a performance of each class of the plurality of classes of the multi-class classifier; partitioning, by the computing device, the plurality of classes into either learnable or unlearnable based on whether the performance each particular class surpasses a particular threshold; training, by the computing device, a predicting classifier on the training dataset, wherein data of the training dataset is labelled as either learnable or unlearnable based on the particular class to which the data corresponds; using, by the computing device, the predicting classifier on a new class to predict whether samples associated with the new class are learnable or unlearnable; and retraining, by the computing device, the multi-class classifier with the samples associated with the new class in response to predicting that the samples are learnable. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A tangible, non-transitory, computer-readable medium storing program instructions that cause a computer to execute a process comprising:
-
training a multi-class classifier on a training dataset, the multi-class classifier having a plurality of classes; evaluating the multi-class classifier on a testing dataset to determine a performance of each class of the plurality of classes of the multi-class classifier; partitioning the plurality of classes into either learnable or unlearnable based on whether the performance each particular class surpasses a particular threshold; training a predicting classifier on the training dataset, wherein data of the training dataset is labelled as either learnable or unlearnable based on the particular class to which the data corresponds; using the predicting classifier on a new class to predict whether samples associated with the new class are learnable or unlearnable; and retraining the multi-class classifier with the samples associated with the new class in response to predicting that the samples are learnable. - View Dependent Claims (14, 15, 16)
-
-
17. An apparatus, comprising:
-
one or more network interfaces to communicate with a computer network; a processor coupled to the network interfaces and configured to execute one or more process; and a memory configured to store a process executable by the processor, the process when executed configured to; train a multi-class classifier on a training dataset, the multi-class classifier having a plurality of classes; evaluate the multi-class classifier on a testing dataset to determine a performance of each class of the plurality of classes of the multi-class classifier; partition the plurality of classes into either learnable or unlearnable based on whether the performance each particular class surpasses a particular threshold; train a predicting classifier on the training dataset, wherein data of the training dataset is labelled as either learnable or unlearnable based on the particular class to which the data corresponds; use the predicting classifier on a new class to predict whether samples associated with the new class are learnable or unlearnable; and retrain the multi-class classifier with the samples associated with the new class in response to predicting that the samples are learnable. - View Dependent Claims (18, 19, 20)
-
Specification