Performing Cross-Validation Using Non-Randomly Selected Cases
First Claim
Patent Images
1. A method for performing cross-validation, comprising:
- given a first set of randomly selected labeled cases and a second set of actively selected labeled cases,training a classifier on a training set comprising a first subset of the first set and a subset of the second set; and
measuring the performance of the classifier on a test set comprising a second subset of the first set that is disjoint relative to the first subset, the test set including no cases from the second set.
8 Assignments
0 Petitions
Accused Products
Abstract
A technique to perform cross-validation using a set of randomly selected labeled cases and a set of non-randomly selected labeled cases. A training set for use during cross-validation can include cases from both sets. A test set for use during cross-validation can include cases from the randomly selected set but exclude cases from the non-randomly selected set.
-
Citations
15 Claims
-
1. A method for performing cross-validation, comprising:
-
given a first set of randomly selected labeled cases and a second set of actively selected labeled cases, training a classifier on a training set comprising a first subset of the first set and a subset of the second set; and measuring the performance of the classifier on a test set comprising a second subset of the first set that is disjoint relative to the first subset, the test set including no cases from the second set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system, comprising:
-
a memory to store a first set of randomly sampled labeled cases and a second set of non-randomly sampled labeled cases; and a cross-validation module to perform cross-validation using the first and second sets of cases, the cross-validation module configured to exclude the second set of non-randomly sampled labeled cases from a test set used in a test phase of the cross-validation. - View Dependent Claims (11, 12, 13)
-
-
14. A non-transitory computer readable storage medium storing instructions that, when executed by a processor, cause a computer to perform cross-validation as follows:
-
train a classifier on a training set comprising a first subset of a first set of randomly selected labeled cases and a subset of a second set of actively selected labeled cases; and measure the performance of the classifier on a test set comprising a second subset of the first set, the test set excluding actively selected cases. - View Dependent Claims (15)
-
Specification