RAPID ITERATIVE DEVELOPMENT OF CLASSIFIERS
First Claim
Patent Images
1. A computer-implemented method of training a classifier, such that the trained classifier is configured to map an instance to one of a plurality of classes, comprising:
- providing a data framework corresponding to a plurality of instances and a plurality of features, the data framework tangibly embodied in a computer-readable medium;
providing a query space, wherein each of a plurality of queries of the query space comprises a subset of the plurality of instances, a subset of the plurality of features and a relevant characteristic function that describes, for that query, a function of instance-feature values associated with that query and of the instance-class probabilities associated with that query, wherein the instance-class probabilities are an indication of a probabilistic model of mapping of the instances associated with the query to at least one of the plurality of classes, the query space tangibly embodied in a computer-readable medium;
operating a computing device to receive an indication of commentary from at least one editor for each of the plurality of queries of the query space, wherein the commentary for each query being the editor'"'"'s estimate of the true value of the relevant characteristic for that query and storing the estimates in the query space in correspondence with the queries;
maintaining a classifier framework tangibly embodied in a computer-readable medium, the classifier framework configured to provide class probabilities for the instances according to a tunable parameter vector;
operating a computing device to determine, from the classifier model, relevant characteristic values corresponding to the queries in the query space;
operating a computing device to determine a distortion value for each query by applying a distortion function to a deviation of the classifier framework response from the indication of editorial commentary for that query; and
operating a computing device to adjust the tunable parameter vector based on a cost function that considers a regularization component and the distortion values over the plurality of queries for which the editors gave commentary.
9 Assignments
0 Petitions
Accused Products
Abstract
A classifier development process seamlessly and intelligently integrates different forms of human feedback on instances and features into the data preparation, learning and evaluation stages. A query utility based active learning approach is applicable to different types of editorial feedback. A bi-clustering based technique may be used to further speed up the active learning process.
40 Citations
22 Claims
-
1. A computer-implemented method of training a classifier, such that the trained classifier is configured to map an instance to one of a plurality of classes, comprising:
-
providing a data framework corresponding to a plurality of instances and a plurality of features, the data framework tangibly embodied in a computer-readable medium; providing a query space, wherein each of a plurality of queries of the query space comprises a subset of the plurality of instances, a subset of the plurality of features and a relevant characteristic function that describes, for that query, a function of instance-feature values associated with that query and of the instance-class probabilities associated with that query, wherein the instance-class probabilities are an indication of a probabilistic model of mapping of the instances associated with the query to at least one of the plurality of classes, the query space tangibly embodied in a computer-readable medium; operating a computing device to receive an indication of commentary from at least one editor for each of the plurality of queries of the query space, wherein the commentary for each query being the editor'"'"'s estimate of the true value of the relevant characteristic for that query and storing the estimates in the query space in correspondence with the queries; maintaining a classifier framework tangibly embodied in a computer-readable medium, the classifier framework configured to provide class probabilities for the instances according to a tunable parameter vector; operating a computing device to determine, from the classifier model, relevant characteristic values corresponding to the queries in the query space; operating a computing device to determine a distortion value for each query by applying a distortion function to a deviation of the classifier framework response from the indication of editorial commentary for that query; and operating a computing device to adjust the tunable parameter vector based on a cost function that considers a regularization component and the distortion values over the plurality of queries for which the editors gave commentary. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented method of training a classifier, such that the trained classifier is configured to map an instance to one of a plurality of classes, comprising:
-
a) providing a data framework corresponding to a plurality of instances and a plurality of features, the data framework tangibly embodied in a computer-readable medium; b) providing a query space, wherein each of a plurality of queries of the query space comprises a subset of the plurality of instances, a subset of the plurality of features and a relevant characteristic function that describes, for that query, a function of instance-feature values associated with that query and of the instance-class probabilities associated with that query, wherein the instance-class probabilities are an indication of a probabilistic model of mapping of the instances associated with the query to at least one of the plurality of classes, the query space tangibly embodied in a computer-readable medium; c) maintaining a classifier framework tangibly embodied in a computer-readable medium, the classifier framework configured to provide class probabilities for the instances according to a tunable parameter vector; d) operating a computing device to provide a query utility function and a stopping criterion function that depend on the plurality of instances and features and the class probabilities associated with the classifier; e) operating a computing device to provide a stopping criterion threshold value; f) operating a computing device to initialize the tunable parameter vector and using it to compute class probabilities of the plurality of instances; g) for each of the plurality of queries, operating a computing device to apply a query utility function to determine a query utility value for that query; h) operating a computing device to present a subset of the plurality of the queries for editorial commentary, wherein a determination of which of the plurality of queries are in the subset is based at least in part on the query utility values; i) operating a computing device to receive an indication of editorial commentary from at least one editor for each of at least some of the subset of queries, wherein the commentary for each query of the subset of queries being the editor'"'"'s estimate of the true value of the relevant characteristic for that query and storing the estimates in the query space in correspondence with the queries; j) operating a computing device to determine a distortion value for each query by applying a distortion function to a deviation of the classifier framework response from the indication of editorial commentary for that query; k) operating a computing device to adjust a tunable parameter vector based on a cost function that considers a regularization component and the distortion values over the plurality of queries for which the editors gave commentary; l) operating a computing device to determine class probabilities of the plurality of instances; m) operating a computing device to determine a stopping criterion function; n) repeating g) to m) until the stopping criterion function goes below the stopping criterion threshold value. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A computer program product comprising at least one tangible computer readable medium having computer program instructions tangibly embodied thereon, the computer program instructions to configure at least one computing device to train a classifier, such that the trained classifier is configured to map an instance to one of a plurality of classes, including to:
-
provide a data framework corresponding to a plurality of instances and a plurality of features, the data framework tangibly embodied in a computer-readable medium; provide a query space, wherein each of a plurality of queries of the query space comprises a subset of the plurality of instances, a subset of the plurality of features and a relevant characteristic function that describes, for that query, a function of instance-feature values associated with that query and of the instance-class probabilities associated with that query, wherein the instance-class probabilities are an indication of a probabilistic model of mapping of the instances associated with the query to at least one of the plurality of classes, the query space tangibly embodied in a computer-readable medium; operate a computing device to receive an indication of commentary from at least one editor for each of the plurality of queries of the query space, wherein the commentary for each query being the editor'"'"'s estimate of the true value of the relevant characteristic for that query and storing the estimates in the query space in correspondence with the queries; maintain a classifier framework tangibly embodied in a computer-readable medium, the classifier framework configured to provide class probabilities for the instances according to a tunable parameter vector; determine, from the classifier model, relevant characteristic values corresponding to the queries in the query space; determine a distortion value for each query by applying a distortion function to a deviation of the classifier framework response from the indication of editorial commentary for that query; and adjust the tunable parameter vector based on a cost function that considers a regularization component and the distortion values over the plurality of queries for which the editors gave commentary. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A computer program product comprising at least one tangible computer readable medium having computer program instructions tangibly embodied thereon, the computer program instructions to configure at least one computing device to train a classifier, such that the trained classifier is configured to map an instance to one of a plurality of classes, including to:
-
a) provide a data framework corresponding to a plurality of instances and a plurality of features; b) provide a query space, wherein each of a plurality of queries of the query space comprises a subset of the plurality of instances, a subset of the plurality of features and a relevant characteristic function that describes, for that query, a function of instance-feature values associated with that query and of the instance-class probabilities associated with that query, wherein the instance-class probabilities are an indication of a probabilistic model of mapping of the instances associated with the query to at least one of the plurality of classes; c) maintaining a classifier framework configured to provide class probabilities for the instances according to a tunable parameter vector; d) providing a query utility function and a stopping criterion function that depend on the plurality of instances and features and the class probabilities associated with the classifier; e) providing a stopping criterion threshold value; f) initializing the tunable parameter vector and using it to compute class probabilities of the plurality of instances; g) for each of the plurality of queries, applying a query utility function to determine a query utility value for that query; h) presenting a subset of the plurality of the queries for editorial commentary, wherein a determination of which of the plurality of queries are in the subset is based at least in part on the query utility values; i) receiving an indication of editorial commentary from at least one editor for each of at least some of the subset of queries, wherein the commentary for each query of the subset of queries being the editor'"'"'s estimate of the true value of the relevant characteristic for that query and storing the estimates in the query space in correspondence with the queries; j) determining a distortion value for each query by applying a distortion function to a deviation of the classifier framework response from the indication of editorial commentary for that query; k) adjusting a tunable parameter vector based on a cost function that considers a regularization component and the distortion values over the plurality of queries for which the editors gave commentary; l) determining class probabilities of the plurality of instances; m) determining a stopping criterion function; n) repeating g) to m) until the stopping criterion function goes below the stopping criterion threshold value. - View Dependent Claims (19, 20, 21, 22)
-
Specification