×

Handling Noise in Training Data for Malware Detection

  • US 20130097704A1
  • Filed: 10/13/2012
  • Published: 04/18/2013
  • Est. Priority Date: 10/13/2011
  • Status: Abandoned Application
First Claim
Patent Images

1. A computer system comprising at least one processor configured to form a set of noise detectors, each noise detector of the set of noise detectors configured to de-noise a corpus of records, wherein the corpus is pre-classified into a subset of clean records and a subset of malware records prior to de-noising, and wherein de-noising the corpus comprises:

  • selecting a first record and a second record from the corpus, the first record being labeled as clean and the second record being labeled as malware;

    in response to selecting the first and second records, determining whether the first and second records are similar according to a set of features; and

    in response, when the first and second records are similar, determine that the first and second records are noise.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×