Methods for feature selection in a learning machine
First Claim
1. A computer-implemented method for identifying a pattern within a large dataset, wherein data points in the dataset correspond to a physical measurement and have a plurality of features that describe attributes of the data point, the method comprising:
- inputting the large dataset into a computer system having a processor and a memory;
selecting a feature subset of the large number of features for processing in a learning machine by executing a feature selection algorithm on the dataset to identify the subset of features, wherein the algorithm is selected from the group consisting of l0-norm minimization and unbalanced correlation, wherein l0-norm minimization comprises finding a smallest number of non-zero elements of a weight vector w in the relationship D(x)=w·
x+b, and unbalanced correlation comprises dividing the dataset into unbalanced groups of positive and negative examples and ranking features according to a success criterion for correctly classifying the dataset;
processing the feature subset of the data points of the large dataset to identify the pattern; and
generating an output to a printer or display device, the output comprising the identified pattern within the large dataset for the feature subset.
3 Assignments
0 Petitions
Accused Products
Abstract
In a pre-processing step prior to training a learning machine, pre-processing includes reducing the quantity of features to be processed using feature selection methods selected from the group consisting of recursive feature elimination (RFE), minimizing the number of non-zero parameters of the system (lo-norm minimization), evaluation of cost function to identify a subset of features that are compatible with constraints imposed by the learning set, unbalanced correlation score and transductive feature selection. The features remaining after feature selection are then used to train a learning machine for purposes of pattern classification, regression, clustering and/or novelty detection. (
154 Citations
20 Claims
-
1. A computer-implemented method for identifying a pattern within a large dataset, wherein data points in the dataset correspond to a physical measurement and have a plurality of features that describe attributes of the data point, the method comprising:
-
inputting the large dataset into a computer system having a processor and a memory; selecting a feature subset of the large number of features for processing in a learning machine by executing a feature selection algorithm on the dataset to identify the subset of features, wherein the algorithm is selected from the group consisting of l0-norm minimization and unbalanced correlation, wherein l0-norm minimization comprises finding a smallest number of non-zero elements of a weight vector w in the relationship D(x)=w·
x+b, and unbalanced correlation comprises dividing the dataset into unbalanced groups of positive and negative examples and ranking features according to a success criterion for correctly classifying the dataset;processing the feature subset of the data points of the large dataset to identify the pattern; and generating an output to a printer or display device, the output comprising the identified pattern within the large dataset for the feature subset. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 19)
-
-
11. A computer-implemented method for identifying a pattern within a large dataset, wherein the data points in the dataset correspond to physical measurements and each data point has a plurality of features that describe attributes of the data point, the method comprising:
-
inputting the large dataset into a computer system having a processor and a memory; selecting a feature subset of the large number of features for processing in a learning machine by executing a feature selection algorithm on the dataset to identify the subset of features, wherein the algorithm comprises l0-norm minimization, wherein l0-norm minimization comprises finding a smallest number of non-zero elements of a weight vector w in the relationship D(x)=w·
x+b;processing the feature subset of the data points of the large dataset to identify the pattern; and generating an output to a printer or display device, the output comprising the identified pattern within the large dataset for the feature subset. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 20)
-
Specification