×

Decision tree induction that is sensitive to attribute computational complexity

  • US 8,190,647 B1
  • Filed: 09/15/2009
  • Issued: 05/29/2012
  • Est. Priority Date: 09/15/2009
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for constructing a decision tree for classifying computer files based on the computational complexities of attributes of the files, comprising:

  • creating a plurality of attribute vectors for a plurality of training files of known classification, each attribute vector comprising values of a predetermined set of attributes for an associated training file;

    determining a complexity score for each attribute in the predetermined set of attributes, the complexity score measuring a cost associated with determining a value of an associated attribute for a file; and

    growing a decision tree based on the plurality of attribute vectors, comprising;

    (1) setting the plurality of attribute vectors as a current set,(2) determining a weighted impurity reduction score for at least one attribute of the predetermined set of attributes based on the complexity score of the attribute, the weighted impurity reduction score quantifying a cost-benefit tradeoff for an associated attribute in classifying the current set,(3) selecting a splitting attribute from the at least one attribute of the predetermined set of attributes,(4) splitting the current set into subsets using the splitting attribute, and(5) repeating steps (2) through (4) for each of the subsets as the current set.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×