Discriminant Forest Classification Method and System
First Claim
1. A hybrid random forest (RF) and discriminant analysis (DA) method of training a computerized system to predict the class membership of a sample of unknown class, comprising:
- providing a forest training set to the computerized system comprising N feature vector ({circumflex over (x)}i) and class label (ŷ
i) pairs, ({circumflex over (x)}iε
,ŷ
iε
{0,1}) for i=1 to N, and from D available features; and
controlling the computerized system to repeat the following set of steps until a desired forest size having n decision trees has been reached;
adding a decision tree to the forest,creating a tree training set associated with the added decision tree, said tree training set comprising N bootstrapped training samples randomly selected with replacement from the forest training set, andusing the tree training set to train the added decision tree by using hierarchical DA-based decisions to perform splitting of decision nodes and thereby grow the added decision tree as a DA-based decision tree,whereby, upon reaching the desired forest size, the computerized system may predict the classification of a sample of unknown class using the n DA-based decision trees.
2 Assignments
0 Petitions
Accused Products
Abstract
A hybrid machine learning methodology and system for classification that combines classical random forest (RF) methodology with discriminant analysis (DA) techniques to provide enhanced classification capability. A DA technique which uses feature measurements of an object to predict its class membership, such as linear discriminant analysis (LDA) or Andersen-Bahadur linear discriminant technique (AB), is used to split the data at each node in each of its classification trees to train and grow the trees and the forest. When training is finished, a set of n DA-based decision trees of a discriminant forest is produced for use in predicting the classification of new samples of unknown class.
69 Citations
15 Claims
-
1. A hybrid random forest (RF) and discriminant analysis (DA) method of training a computerized system to predict the class membership of a sample of unknown class, comprising:
-
providing a forest training set to the computerized system comprising N feature vector ({circumflex over (x)}i) and class label (ŷ
i) pairs, ({circumflex over (x)}iε
,ŷ
iε
{0,1}) for i=1 to N, and from D available features; andcontrolling the computerized system to repeat the following set of steps until a desired forest size having n decision trees has been reached; adding a decision tree to the forest, creating a tree training set associated with the added decision tree, said tree training set comprising N bootstrapped training samples randomly selected with replacement from the forest training set, and using the tree training set to train the added decision tree by using hierarchical DA-based decisions to perform splitting of decision nodes and thereby grow the added decision tree as a DA-based decision tree, whereby, upon reaching the desired forest size, the computerized system may predict the classification of a sample of unknown class using the n DA-based decision trees. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
Specification