Method for statistical regression using ensembles of classification solutions
First Claim
Patent Images
1. A method for statistical regression using ensembles of classification solutions comprising the steps of:
- running k-means clustering for k clusters on the set of values {yl,i=1 . . . n};
recording a mean value mj of a cluster cj for j=1 . . . k;
transforming regression data into classification data with a class label for an i-th case being a cluster number of yi;
applying ensemble classifier and obtain a set of rules R; and
making a prediction for new case u, using a margin of M, where 0≦
M≦
1.
1 Assignment
0 Petitions
Accused Products
Abstract
A pattern recognition method induces ensembles of decision rules from data regression problems. Instead of direct prediction of a continuous output variable, the method discretizes the variable by k-means clustering and solves the resultant classification problem. Predictions on new examples are made by averaging the mean values of classes with votes that are close in number to the most likely class.
16 Citations
7 Claims
-
1. A method for statistical regression using ensembles of classification solutions comprising the steps of:
-
running k-means clustering for k clusters on the set of values {yl,i=1 . . . n};
recording a mean value mj of a cluster cj for j=1 . . . k;
transforming regression data into classification data with a class label for an i-th case being a cluster number of yi;
applying ensemble classifier and obtain a set of rules R; and
making a prediction for new case u, using a margin of M, where 0≦
M≦
1. - View Dependent Claims (2)
-
-
3. A method of pattern recognition comprising the steps of:
-
applying clustering processes to determine a number of classes;
applying ensemble learning classification processes to predict most likely classes for a new example; and
then averaging regression values of most likely classes to predict a value of a new example.
-
-
4. A method of pattern recognition for a set of values, said method comprising the steps of:
-
determining a number of classes to be generated based on a trend of error of a class mean/median for the set of values;
classifying the values using ensemble learning classification and the determined number of classes;
generating a set of classification rules; and
averaging regression values of most likely classes to predict a value of a new example based on the set of rules. - View Dependent Claims (5, 6, 7)
-
Specification