METHODS AND SYSTEMS FOR HIGH CONFIDENCE UTILIZATION OF DATASETS
First Claim
1. A method for summarizing parameter value, the method being implemented in a computer and comprising the steps of:
- grouping measurement result from a data set into a plurality of pairs of measurement results;
determining, for said each one pair of measurement results, whether predetermined measures for said each one pair of measurement results satisfy threshold criteria;
classifying a pair of measurement results from said plurality of pairs of measurement results as not changing if the predetermined measures do not satisfy the threshold criteria;
comparing, if the predetermined measures satisfied the threshold criteria, one measurement result in said each one pair of measurement results to another measurement result in said each one pair of measurement results;
classifying, after the comparison, said each one pair of measurement results according to result of the comparison;
selecting a common set of measurement results from the classified plurality of pairs of measurement result for use with the data set; and
providing summary measures for a parameter utilizing the common set;
said determining, classifying, comparing, selecting and providing being performed by means of a computer usable medium having computer readable code that causes the computer to perform said steps;
whereby the data set includes data obtained from large-scale measurements of organismal and cellular state involving multiple independent measurements of each parameter, said parameters including genes, transcripts and proteins; and
wherein the summarized parameter values are utilized for decision making to increase confidence on the use of the data in activities including manufacturing, handling, hybridization and gene expression.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and systems for high-confidence utilization of datasets are disclosed. In one embodiment, the method includes selecting a metric for determining substantially optimal combination of true positives and false positives in a data set, applying an optimization technique, and obtaining, from the results of the optimization technique, a value for at least one optimization parameter, the value for at least one optimization parameter resulting in substantially optimal combination of true positives and false positives. A number of true positives and a number of false positives are a function of the one or more optimization parameters.
-
Citations
8 Claims
-
1. A method for summarizing parameter value, the method being implemented in a computer and comprising the steps of:
-
grouping measurement result from a data set into a plurality of pairs of measurement results; determining, for said each one pair of measurement results, whether predetermined measures for said each one pair of measurement results satisfy threshold criteria; classifying a pair of measurement results from said plurality of pairs of measurement results as not changing if the predetermined measures do not satisfy the threshold criteria; comparing, if the predetermined measures satisfied the threshold criteria, one measurement result in said each one pair of measurement results to another measurement result in said each one pair of measurement results; classifying, after the comparison, said each one pair of measurement results according to result of the comparison; selecting a common set of measurement results from the classified plurality of pairs of measurement result for use with the data set; and providing summary measures for a parameter utilizing the common set; said determining, classifying, comparing, selecting and providing being performed by means of a computer usable medium having computer readable code that causes the computer to perform said steps; whereby the data set includes data obtained from large-scale measurements of organismal and cellular state involving multiple independent measurements of each parameter, said parameters including genes, transcripts and proteins; and wherein the summarized parameter values are utilized for decision making to increase confidence on the use of the data in activities including manufacturing, handling, hybridization and gene expression. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for determining efficacy of a data set analyses techniques, the method being implemented in a computer and comprising the steps of:
-
scaling a data set and replicates of the data set; comparing the scaled data set to predetermined groups of original data sets in order to test a predetermined analysis results; wherein efficacy can be determined; said scaling and comparing being performed by means of a computer usable medium having computer readable code that causes the computer to perform said steps; whereby the data set being analyzed includes data obtained from large-scale measurements of organismal and cellular state involving multiple independent measurements of each parameter, said parameters including genes, transcripts and proteins; and wherein the efficacy of a data set analyses techniques is utilized for decision making to increase confidence on the use of the data in activities including manufacturing, handling, hybridization, gene expression.
-
-
8. A computer program product comprising a computer usable medium having computer readable code embodied therein;
- said computer readable code being capable of causing a computer system to;
scale a data set and replicates of the data set; and compare the scaled data set to predetermined groups of original data sets in order to test a predetermined analysis results; wherein efficacy can be determined; whereby the data set being analyzed includes data obtained from large-scale measurements of organismal and cellular state involving multiple independent measurements of each parameter, said parameters including genes, transcripts and proteins; and wherein the efficacy of a data set analyses techniques is utilized for decision making to increase confidence on the use of the data in activities including manufacturing, handling, hybridization and gene expression.
- said computer readable code being capable of causing a computer system to;
Specification