×

Automatically grouping malware based on artifacts

  • US 10,230,749 B1
  • Filed: 02/29/2016
  • Issued: 03/12/2019
  • Est. Priority Date: 02/29/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method, comprising:

  • receiving a plurality of samples for performing automated malware analysis to generate log files based on the automated malware analysis;

    processing the log files to extract features associated with malware, wherein each of the extracted features corresponds to a line or a sub-line in one or more of the log files determined to be an artifact associated with malware;

    clustering the plurality of samples based on the extracted features, wherein clustering the plurality of samples based on the extracted features further comprises;

    selecting one or more of the extracted features and assigning values to each indicator, wherein selecting one or more of the extracted features includes performing a pre-filtering operation to select the extracted features for clustering based on a threshold association between the line or the sub-line in the one or more of the log files and known malware;

    collecting the assigned values in an array for each of the plurality of samples;

    comparing the assigned values of the array between two of the plurality of samples; and

    calculating a distance between the two samples, wherein the samples within a defined threshold of distance are clustered; and

    performing an action based on an output of clustering the plurality of samples based on the extracted features, wherein the action based on the output of clustering the plurality of samples based on the extracted features further comprises validate the output of clustering the plurality of samples based on the extracted features based on tags to identify previously identified malware groups.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×