METHOD AND SYSTEM FOR DETERMINING THE ACCURACY OF DNA BASE IDENTIFICATIONS
First Claim
Patent Images
1. A method in a computer for determining the quality of predicted DNA base identifications, the method comprising:
- receiving a training data set, the training data set comprising a plurality of predicted DNA base identifications;
defining a group of subsets;
comparing the predicted DNA base identifications with actual DNA base identifications for training data within each subset of the group;
determining a sampling characteristic for each subset of the group based on training data within the respective subset; and
determining a quality characterization for predicted DNA base identifications within at least one of subset of the group based on the comparison and determined sampling characteristic.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments disclosed herein relate to a method and system for determining the accuracy of DNA base identifications, based at least partly on sampling characteristics of subsets within training data sets.
41 Citations
38 Claims
-
1. A method in a computer for determining the quality of predicted DNA base identifications, the method comprising:
-
receiving a training data set, the training data set comprising a plurality of predicted DNA base identifications; defining a group of subsets; comparing the predicted DNA base identifications with actual DNA base identifications for training data within each subset of the group; determining a sampling characteristic for each subset of the group based on training data within the respective subset; and determining a quality characterization for predicted DNA base identifications within at least one of subset of the group based on the comparison and determined sampling characteristic. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system for determining the quality of DNA base identifications, the system comprising:
-
a predicted identity input component configured to receive a plurality of predicted DNA base identifications associated with a training data set; a subset generator configured to define a group of subsets; an identity comparison component configured to compare the predicted DNA base identifications with actual DNA base identifications for training data within each subset of the group; a sampling determination component configured to determine a sampling characteristic for each subset of the group based on training data within the respective subset; and a quality characterization determination component configured to determine a quality characterization for predicted DNA base identifications within at least one of subset of the group based on the comparison and determined sampling characteristic. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
Specification