Forward feature selection for support vector machines
First Claim
Patent Images
1. A method comprising:
- defining an iteration counter to a predetermined value;
training, using a processor of a computer system, a Support Vector Machine (SVM) on a subset of features (d′
) of a feature set having (d) features of a plurality of training instances to obtain a weight per instance ({right arrow over (α
)}′
);
approximating a quality for the d features of the feature set using the weight per instance;
ranking the d features of the feature set based on the approximated quality;
selecting a subset (q) of the features of the feature set based on the ranked approximated quality; and
iterating training the SVM, approximating the quality, ranking the d features, and selecting the q subset until the q subset is less than a selected threshold.
1 Assignment
0 Petitions
Accused Products
Abstract
In one embodiment, the present invention includes a method for training a Support Vector Machine (SVM) on a subset of features (d′) of a feature set having (d) features of a plurality of training instances to obtain a weight per instance, approximating a quality for the d features of the feature set using the weight per instance, ranking the d features of the feature set based on the approximated quality, and selecting a subset (q) of the features of the feature set based on the ranked approximated quality. Other embodiments are described and claimed.
4 Citations
20 Claims
-
1. A method comprising:
-
defining an iteration counter to a predetermined value; training, using a processor of a computer system, a Support Vector Machine (SVM) on a subset of features (d′
) of a feature set having (d) features of a plurality of training instances to obtain a weight per instance ({right arrow over (α
)}′
);approximating a quality for the d features of the feature set using the weight per instance; ranking the d features of the feature set based on the approximated quality; selecting a subset (q) of the features of the feature set based on the ranked approximated quality; and iterating training the SVM, approximating the quality, ranking the d features, and selecting the q subset until the q subset is less than a selected threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An article comprising a non-transitory machine-accessible medium including instructions that when executed cause a system to:
-
define an iteration counter to a predetermined value; train a Support Vector Machine (SVM) according to a forward feature selection algorithm in which only a random subset of features (d′
) of a feature set (d) of a plurality of training instances are used to obtain a weight per instance (α
′
);approximate a quality for the d features of the feature set using the weight per instance; rank the d features of the feature set based on the approximated quality, and select a subset (q) of the features of the feature set based on the ranked approximated quality; classify unlabeled data using the trained SVM; and determine whether the q subset is less than a first threshold, and if so conclude the SVM training, and otherwise iterate training the SVM, approximating the quality, ranking the remaining q features from the previous iteration, and selecting the q subset until the q subset is less than the first threshold. - View Dependent Claims (13, 14)
-
-
15. A system comprising:
-
a processor to perform instructions; and a memory coupled to the processor to store instructions that enable the processor to define an iteration counter to a predetermined value, train a Support Vector Machine (SVM) on a subset of features (d′
) of a feature set having (d) features of a plurality of training instances to obtain a weight per instance, approximate a quality for each of the d features of the feature set using the weight per instance, rank the d features of the feature set based on the approximated quality, select a subset (q) of the features of the feature set based on the ranked approximated quality, determine whether the q subset is less than a first threshold, and if so conclude the SVM training, and iterate training the SVM, approximating the quality, ranking the remaining q features from the previous iteration, and selecting the q subset until the q subset is less than the first threshold if the q subset is greater than the first threshold. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification