Recursive feature elimination method using support vector machines
DC CAFCFirst Claim
1. A method, comprising:
- retrieving training data from one or more storage devices in communication with a processor, the processor operable for;
determining a value for each feature in a group of features provided by the training data;
eliminating at least one feature with a minimum ranking criterion from the group, wherein the minimum ranking criterion is obtained based on the value for each feature in the group;
subtracting a matrix from the kernel data to provide an updated kernel data, each component of the matrix comprising a dot product of two of training samples provided by at least a part of the training data that corresponds to the eliminated feature;
updating the value for each feature of the group based on the updated kernel data;
repeating of eliminating the at least one feature from the group and updating the value for each feature of the group until a number of features in the group reaches a predetermined value to generate a feature ranking list; and
recognizing a new data corresponding to the group of features with the feature ranking list.
3 Assignments
Litigations
1 Petition
Accused Products
Abstract
Identification of a determinative subset of features from within a group of features is performed by training a support vector machine using training samples with class labels to determine a value of each feature, where features are removed based on their the value. One or more features having the smallest values are removed and an updated kernel matrix is generated using the remaining features. The process is repeated until a predetermined number of features remain which are capable of accurately separating the data into different classes. In some embodiments, features are eliminated by a ranking criterion based on a Lagrange multiplier corresponding to each training sample.
-
Citations
23 Claims
-
1. A method, comprising:
-
retrieving training data from one or more storage devices in communication with a processor, the processor operable for; determining a value for each feature in a group of features provided by the training data; eliminating at least one feature with a minimum ranking criterion from the group, wherein the minimum ranking criterion is obtained based on the value for each feature in the group; subtracting a matrix from the kernel data to provide an updated kernel data, each component of the matrix comprising a dot product of two of training samples provided by at least a part of the training data that corresponds to the eliminated feature; updating the value for each feature of the group based on the updated kernel data; repeating of eliminating the at least one feature from the group and updating the value for each feature of the group until a number of features in the group reaches a predetermined value to generate a feature ranking list; and recognizing a new data corresponding to the group of features with the feature ranking list. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A non-transitory machine-readable medium comprising a plurality of instructions, that in response to being executed, result in a computing system executing a support vector machine, comprising:
-
a training function to determine a value for each feature in a group of features provided by a training data; and an eliminate function to eliminate at least one feature with a minimum ranking criterion from the group, wherein the minimum ranking criterion is obtained based on the value for each feature in the group, wherein the training function further comprises a kernel data function to subtract a matrix from the kernel data to provide an updated kernel data, each component of the matrix comprising a dot product of two of training samples provided by at least a part of the training data that corresponds to the eliminated feature, and a value update function to update the value for each feature based on the updated kernel data, and wherein the apparatus support vector machine further repeats eliminating the at least one feature from the group and updating the value for each feature of the group until a number of features in the group reaches a predetermined value, to generate a feature ranking list for a use of recognizing a new data corresponding to the group of features. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A non-transitory machine-readable medium comprising a plurality of instructions that in response to being executed result in a computing system:
-
determining a value for each feature in a group of features provided by a training data; eliminating at least one feature with a minimum ranking criterion from the group, wherein the minimum ranking criterion is obtained based on the value for each feature in the group; subtracting a matrix from the kernel data to provide an updated kernel data, each component of the matrix comprising a dot product of two of training samples provided by at least a part of the training data that corresponds to the eliminated feature; updating the value for each feature of the group based on the updated kernel data; repeating of eliminating the at least one feature from the group and updating the value for each feature of the group until a number of features in the group reaches a predetermined value to generate a feature ranking list; and recognizing a new data corresponding to the group of features with the feature ranking list. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A computer-implemented method for predicting patterns in sample data, wherein the sample data comprises a group of features that describe the data, the method comprising:
-
identifying a determinative subset of features that are most correlated to the patterns comprising; retrieving a training data having class labels with respect to the patterns from a memory in communication with a computer processor programmed for executing a support vector machine comprising a kernel; calculating a kernel matrix using the training data to determine a value for each feature in the group of features; eliminating at least one feature with a minimum value from the group; calculating an updated kernel matrix, each component of the updated kernel matrix comprising a dot product of two training samples provided by at least a part of the training data that corresponds to the eliminated feature; determining an updated value for each remaining feature of the group of features based on the updated kernel matrix; repeating steps eliminating, calculating an updated kernel matrix and determining an updated value for a plurality of iterations until a pre-determined number of features in the group remain; and generating an output comprising a feature ranking list, wherein the features in the feature ranking list comprise the determinative subset of features for predicting patterns in new data. - View Dependent Claims (19, 20, 21, 22, 23)
-
Specification