Method and system for constructing a classifier
First Claim
1. A computer-implemented method for constructing a classifier classifying data records into specific categories, comprising:
- a) clustering a set of records that are to be classified into a plurality of clusters of records by applying a clustering process to the set of records that are to be classified;
b) creating a first classifier based upon the clustered set of records that are to be classified, the first classifier enabled to classify records into a plurality of training clusters;
c) applying the first classifier to a set of training records to create a plurality of training clusters, each record in each training cluster of the plurality of training clusters having a corresponding predicted class;
d) creating separate classifiers for each training cluster of the plurality of training clusters; and
,e) applying one of the created separate classifiers to a corresponding one of the clusters of records amongst the clustered set of records in order to classify the corresponding one of the clusters of records,wherein the first classifier and also each separate classifier learns one or more sets of relationships that define each of a known set of predicted classes and wherein the computer-implemented method first processes the set of records that are to be classified to determine a context before using the set of training records to classify the set of records based upon the determined context.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for constructing a classifier for a set of records to be classified (108) into predicted classes (351-353) are provided. The set of records that are to be classified (108) are clustered into a plurality of clusters. A first classifier (106) is created that classifies records into the plurality of clusters (321-323) and the first classifier (106) is applied to a set of training records (110), each of the training records (331-334) having a predicted class (306). A classifier (107A-C) may then be created for each sub-set of training records (341-343) classed into each of the plurality of clusters and the classifier (107A-C) applied to a sub-set of records to be classified (311-313) formed in the corresponding cluster.
-
Citations
9 Claims
-
1. A computer-implemented method for constructing a classifier classifying data records into specific categories, comprising:
-
a) clustering a set of records that are to be classified into a plurality of clusters of records by applying a clustering process to the set of records that are to be classified; b) creating a first classifier based upon the clustered set of records that are to be classified, the first classifier enabled to classify records into a plurality of training clusters; c) applying the first classifier to a set of training records to create a plurality of training clusters, each record in each training cluster of the plurality of training clusters having a corresponding predicted class; d) creating separate classifiers for each training cluster of the plurality of training clusters; and
,e) applying one of the created separate classifiers to a corresponding one of the clusters of records amongst the clustered set of records in order to classify the corresponding one of the clusters of records, wherein the first classifier and also each separate classifier learns one or more sets of relationships that define each of a known set of predicted classes and wherein the computer-implemented method first processes the set of records that are to be classified to determine a context before using the set of training records to classify the set of records based upon the determined context. - View Dependent Claims (2, 3, 4)
-
-
5. A computer hardware system for constructing a classifier classifying data records into specific categories, comprising:
at least one processor, wherein the at least one processor configured to perform; clustering a set of records that are to be classified into a plurality of clusters of records by applying a clustering process to the set of records that are to be classified; creating a first classifier based upon the clustered set of records that are to be classified, the first classifier enabled to classify records into a plurality of training clusters; applying the first classifier to a set of training records to create a plurality of training clusters, each record in each training cluster of the plurality of training clusters having a corresponding predicted class; creating separate classifiers for each training cluster the plurality of training clusters; and
,applying one of the created separate classifiers to a corresponding one of the clusters of records amongst the clustered set of records in order to classify the corresponding one of the clusters of records, and wherein the at least one processor first processes the set of records that are to be classified to determine a context before using the set of training records to classify the set of records based upon the determined context. - View Dependent Claims (6, 7, 8)
-
9. A non-transitory computer readable storage medium for constructing a classifier, comprising computer readable instructions, which, when executed on a computer system, cause the computer system to perform the steps of:
-
a) clustering a set of records that are to be classified into a plurality of clusters of records by applying a clustering process to the set of records that are to be classified; b) creating a first classifier based upon the clustered set of records that are to be classified, the first classifier enabled to classify records into a plurality of training clusters; c) applying the first classifier to a set of training records to create a plurality of training clusters, each record in each training cluster of the plurality of training clusters having a corresponding predicted class; d) creating separate classifiers for each training cluster of the plurality of training clusters; and
,e) applying one of the created separate classifiers to a corresponding one of the clusters of records amongst the clustered set of records in order to classify the corresponding one of the clusters of records, wherein the first classifier and also each separate classifier learns one or more sets of relationships that define each of a known set of predicted classes and wherein the computer system first processes the set of records that are to be classified to determine a context before using the set of training records to classify the set of records based upon the determined context.
-
Specification