METHOD FOR FEATURE SELECTION IN A SUPPORT VECTOR MACHINE USING FEATURE RANKING
First Claim
1. A computer-implemented method for selecting a subset of features for processing in a learning machine, wherein the features correspond to a dataset to be analyzed for patterns, the method comprising:
- ranking the features according to a distance between extremal points of two classes of interest; and
selecting the subset of features having the highest rank.
3 Assignments
0 Petitions
Accused Products
Abstract
In a pre-processing step prior to training a learning machine, pre-processing includes reducing the quantity of features to be processed using feature selection methods selected from the group consisting of recursive feature elimination (RFE), minimizing the number of non-zero parameters of the system (l0-norm minimization), evaluation of cost function to identify a subset of features that are compatible with constraints imposed by the learning set, unbalanced correlation score, transductive feature selection and single feature using margin-based ranking. The features remaining after feature selection are then used to train a learning machine for purposes of pattern classification, regression, clustering and/or novelty detection.
-
Citations
13 Claims
-
1. A computer-implemented method for selecting a subset of features for processing in a learning machine, wherein the features correspond to a dataset to be analyzed for patterns, the method comprising:
-
ranking the features according to a distance between extremal points of two classes of interest; and selecting the subset of features having the highest rank. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-implemented method for selecting a subset of features for processing in a learning machine, wherein the features correspond to a dataset to be analyzed for patterns, the method comprising:
-
determining a margin between extremal points of two classes of interest; and ranking the subset of features according to the size of the margin, wherein the largest margin corresponds to the highest rank. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A method for diagnosing renal cancer in a patient comprising:
entering gene expression data into a computer adapted for implementing a learning machine, the gene expressions data comprising gene expression levels in tissue obtained from the patient for a gene selected from the group consisting of small inducible cytokine A2 (monocyte chemotactic protein
1) and ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit, isoform 1, cardiac muscle.
Specification