×

Automatically grouping malware based on artifacts

  • US 10,581,892 B2
  • Filed: 01/18/2019
  • Issued: 03/03/2020
  • Est. Priority Date: 02/29/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method, comprising:

  • clustering a plurality of samples based on a plurality of features associated with malware, wherein each of the features corresponds to a line or a sub-line in one or more log files determined to be an artifact associated with malware based on an automated malware analysis, wherein clustering the plurality of samples based on the plurality of features further comprises;

    selecting one or more of the plurality of features and assigning values to each indicator, wherein selecting one or more of the plurality of features includes performing a pre-filtering operation to select the plurality of features for clustering based on a threshold association between the line or the sub-line in the one or more of the log files and known malware;

    collecting the assigned values in an array for each of the plurality of samples;

    comparing the assigned values of the array between two of the plurality of samples; and

    calculating a distance between the two samples, wherein the samples within a defined threshold of distance are clustered; and

    performing an action based on an output of clustering the plurality of samples based on the plurality of features, wherein the action based on the output of clustering the plurality of samples based on the plurality of features further comprises validate the output of clustering the plurality of samples based on the plurality of features based on tags to identify previously identified malware groups.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×