×

Data mining method and system for generating a decision tree classifier for data records based on a minimum description length (MDL) and presorting of records

  • US 5,787,274 A
  • Filed: 11/29/1995
  • Issued: 07/28/1998
  • Est. Priority Date: 11/29/1995
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for generating a decision tree classifier from a training set of records, each record having at least one numeric attribute and a class label of a class to which the record belongs, the method comprising the steps of:

  • pre-sorting the records based on the numeric attribute;

    generating an attribute list for each attribute and a class list, each entry of the class list corresponding to a class label, the attribute list including values of the attribute and indices to the class labels;

    creating a decision tree from the pre-sorted records, attribute lists, and class list, using a breadth-first process, the decision tree having a root node, a plurality of interior nodes, and a plurality of leaf nodes, the breadth-first process being such that the nodes of a same depth from the root node are formed in parallel; and

    pruning the decision tree based on a Minimum Description Length (MDL) scheme to obtain the decision tree classifier, the MDL scheme encoding the decision tree as a model such that an encoding cost for describing the decision tree and the training set is minimized, and the method taking into account the encoding cost in the pruning.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×