Feature selection method using support vector machine classifier
4 Assignments
0 Petitions
Accused Products
Abstract
Identification of a determinative subset of features from within a large set of features is performed by training a support vector machine to rank the features according to classifier weights, where features are removed to determine how their removal affects the value of the classifier weights. The features having the smallest weight values are removed and a new support vector machine is trained with the remaining weights. The process is repeated until a relatively small subset of features remain that is capable of accurately separating the data into different patterns or classes. The method is applied for selecting the smallest number of genes that are capable of accurately distinguishing between medical conditions such as cancer and non-cancer.
-
Citations
33 Claims
-
1-32. -32. (canceled)
-
33. A computer-implemented method for predicting patterns in data, wherein the data comprises a large set of features that describe the data, the method comprising:
-
identifying a determinative subset of features that are most correlated to the patterns comprising;
(a) inputting the data into a computer processor programmed for executing support vector machine classifiers;
(b) training a support vector machine classifier with a training data set having known outcomes with respect to the patterns, wherein the classifier comprises weights having weight values that correspond to the features in the data set and removal of a subset of features affects the weight values;
(c) ranking the features according to their corresponding weight values;
(d) removing one or more features corresponding to the smallest weight values;
(e) training a new classifier with the remaining features;
(f) repeating steps (c) through (e) for a plurality of iterations until a final subset having a pre-determined number of features remains; and
generating an output comprising a listing of the features in the final subset, wherein the final subset comprises the determinative subset of features.
-
Specification