Methods and systems for high confidence utilization of datasets
First Claim
1. A computer-implemented method for summarizing parameter value, the method being implemented in a computer and comprising the steps of:
- grouping measurement results from a data set into a plurality of pairs of measurement results;
determining, for each one pair of measurement results, whether predetermined measures for said each one pair of measurement results satisfy threshold criteria;
labeling a pair of measurement results from said plurality of pairs of measurement results as not changing if the predetermined measures do not satisfy the threshold criteria;
comparing, if the predetermined measures satisfied the threshold criteria, one measurement result in said each one pair of measurement results to another measurement result in said each one pair of measurement results;
classifying, after the comparison, said each one pair of measurement results according to result of the comparison, resulting in a classified plurality of pairs of measurement results;
selecting a common set of measurement results from the classified plurality of pairs of measurement results for use with the data set; and
providing summary measures for a parameter utilizing the common set;
wherein the steps of grouping measurement results, determining whether predetermined measures satisfy threshold criteria, labeling a pair of measurement results, comparing one measurement to another measurement result, classifying according to result of comparison, selecting a common set of measurement results and providing summary measures are performed by a processor executing computer readable code embedded in a non-transitory computer usable medium to perform said steps;
whereby the data set includes data obtained from large-scale measurements of organismal and cellular state involving multiple independent measurements of each parameter, said parameters including genes, transcripts and proteins; and
wherein the summarized parameter values are utilized for decision making to increase confidence on the use of the data in activities including manufacturing of analysis equipment, hybridization of protein, and gene expression.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and systems for high-confidence utilization of datasets are disclosed. In one embodiment, the method includes selecting a metric for determining substantially optimal combination of true positives and false positives in a data set, applying an optimization technique, and obtaining, from the results of the optimization technique, a value for at least one optimization parameter, the value for at least one optimization parameter resulting in substantially optimal combination of true positives and false positives. A number of true positives and a number of false positives are a function of the one or more optimization parameters.
-
Citations
6 Claims
-
1. A computer-implemented method for summarizing parameter value, the method being implemented in a computer and comprising the steps of:
-
grouping measurement results from a data set into a plurality of pairs of measurement results; determining, for each one pair of measurement results, whether predetermined measures for said each one pair of measurement results satisfy threshold criteria; labeling a pair of measurement results from said plurality of pairs of measurement results as not changing if the predetermined measures do not satisfy the threshold criteria; comparing, if the predetermined measures satisfied the threshold criteria, one measurement result in said each one pair of measurement results to another measurement result in said each one pair of measurement results; classifying, after the comparison, said each one pair of measurement results according to result of the comparison, resulting in a classified plurality of pairs of measurement results; selecting a common set of measurement results from the classified plurality of pairs of measurement results for use with the data set; and providing summary measures for a parameter utilizing the common set; wherein the steps of grouping measurement results, determining whether predetermined measures satisfy threshold criteria, labeling a pair of measurement results, comparing one measurement to another measurement result, classifying according to result of comparison, selecting a common set of measurement results and providing summary measures are performed by a processor executing computer readable code embedded in a non-transitory computer usable medium to perform said steps; whereby the data set includes data obtained from large-scale measurements of organismal and cellular state involving multiple independent measurements of each parameter, said parameters including genes, transcripts and proteins; and
wherein the summarized parameter values are utilized for decision making to increase confidence on the use of the data in activities including manufacturing of analysis equipment, hybridization of protein, and gene expression. - View Dependent Claims (2, 3, 4, 5, 6)
-
Specification