Reliability measure for a classifier
First Claim
Patent Images
1. A method for determining a class for at least one data item, the method comprising:
- inputting a data item into a scoring classifier such that the scoring classifier indicates that the data item belongs to a first class;
determining an amount of retraining of the scoring classifier that is required to cause the scoring classifier to indicate that the data item belongs to a second class that is different from the first class;
determining a reliability measure based on the required amount of retraining; and
determining a class of the data item based, at least in part, on the reliability measure.
9 Assignments
0 Petitions
Accused Products
Abstract
In one aspect, a data item is input into a scoring classifier such that the scoring classifier indicates that the data item belongs to a first class. A determination is made as to the amount of retraining of the scoring classifier, based on the data item, that is required to cause the scoring classifier to indicate that the data item belongs to a second class. A reliability measure is determined based on the required amount of retraining and a class of the data item is determined based, at least in part, on the reliability measure.
64 Citations
23 Claims
-
1. A method for determining a class for at least one data item, the method comprising:
-
inputting a data item into a scoring classifier such that the scoring classifier indicates that the data item belongs to a first class; determining an amount of retraining of the scoring classifier that is required to cause the scoring classifier to indicate that the data item belongs to a second class that is different from the first class; determining a reliability measure based on the required amount of retraining; and determining a class of the data item based, at least in part, on the reliability measure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-readable storage medium storing a program for determining a class for at least one data item, the program comprising code for causing a processing device to perform the following operations:
-
input a data item into a scoring classifier such that the scoring classifier indicates that the data item belongs to a first class; determine an amount of retraining of the scoring classifier that is required to cause the scoring classifier to indicate that the data item belongs to a second class that is different from the first class; determining a reliability measure based on the required amount of retraining; and determining a class of the data item based, at least in part, on the reliability measure. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method for determining a class for a data item, the method comprising:
-
developing a classification model based on a set of training data items, the set of training data items including a first group of training data items having a first class and a second group of training data items having a second class different from the first class; applying the classification model to a data item to produce a first classification score for the data item, the first classification score indicating that the data item belongs to the first class; determining a number of times the data item would need to be added to the second group of training data items to create a modified set of training data items such that, if the classification model was developed with the modified set of training data items, applying the classification model to the data item would produce a second classification score that indicates the data item belongs to the second class; determining a reliability measure based on the number of times the data item would need to be added to the second group of data items to create the modified training set of data items; modifying the first classification score based on the reliability measure to produce a classification output; and comparing the classification output to a classification threshold to determine whether the data item belongs to the first class or the second class. - View Dependent Claims (20, 21)
-
-
22. A method for determining a class for at least one data item, the method comprising:
-
inputting a data item into a scoring classifier such that the scoring classifier indicates that the data item belongs to a first class; determining an amount of retraining of the scoring classifier based on the data item that is required to cause the scoring classifier to indicate that the data item belongs to a second class; determining a reliability measure based on the required amount of retraining; and determining a class of the data item based, at least in part, on the reliability measure, wherein determining the reliability measure based on the required amount of retraining comprises; determining a new probability distribution of features for the second class based on the required amount of retraining; and measuring a difference between an original probability distribution of features for the second class and the new probability distribution of features for the second class. - View Dependent Claims (23)
-
Specification