Method for determining biological expression levels by linear programming
First Claim
1. A method for determining a matrix of expression levels corresponding to a set of biological targets and a set of biological samples, comprising:
- obtaining a matrix of signal values P corresponding to the set of biological targets;
computing a vector of expression levels for a sample in the set of biological samples using the matrix of signal values P;
storing the vector of expression levels computed in the computing step in a storage matrix;
repeating the computing and storing steps for each sample in the set of biological samples; and
outputting the storage matrix as the matrix of expression levels.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for determining a matrix of expression levels corresponding to a set of biological targets (e.g., genes or gene fragments) and a set of biological samples, including obtaining a matrix of signal values corresponding to the set of biological targets; computing a vector of expression levels for a sample in the set of biological samples using the matrix of signal values; storing the vector of computed expression levels in a storage matrix; repeating the computing and storing steps for each sample in the set of biological samples; and outputting the storage matrix as the matrix of expression levels. The method, based on a linear programming formulation of the problem, works for both “promiscuous” probe array data, in which there may be multiple targets indicated by a single probe, and the “polygamous” case, in which there are multiple probes for a single target. The preferred method can also process data obtained from multiple SAGE analyses using multiple markers. A second embodiment of the method determines optimal expression levels when the available probe data is noisy or uncertain.
6 Citations
27 Claims
-
1. A method for determining a matrix of expression levels corresponding to a set of biological targets and a set of biological samples, comprising:
-
obtaining a matrix of signal values P corresponding to the set of biological targets;
computing a vector of expression levels for a sample in the set of biological samples using the matrix of signal values P;
storing the vector of expression levels computed in the computing step in a storage matrix;
repeating the computing and storing steps for each sample in the set of biological samples; and
outputting the storage matrix as the matrix of expression levels. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 26)
-
-
17. A method for determining a vector of expression levels, comprising:
-
obtaining a vector of signal values A corresponding to a biological sample;
obtaining a matrix of signal values P corresponding to a set of biological targets;
determining all N nonnegative vectors satisfying an equation PE=A;
outputting, if N=1, the nonnegative vector determined in the determining step as the vector of expression levels;
computing, if N=0, a nonnegative vector E* that minimizes an L1 norm of an inconsistency in the equation PE=A, and outputting the nonnegative vector E* as the vector of expression levels; and
selecting and outputting, if N>
1, one of the N nonnegative vectors determined in the determining step as the vector of expression levels. - View Dependent Claims (18, 19, 27)
-
-
20. A method for identifying a set of biological targets S(A) consistent with a vector of nonnegative signal values A, comprising:
-
obtaining a vector of nonnegative signal values P(g) for a target g in a known universe of biological targets U;
determining if, for every positive element of P(g), whether a respective element of A is also positive, and if so, including the target g in the set of biological targets S(A);
repeating the obtaining and determining steps for each target g in the known universe of biological targets U; and
outputting the set of biological targets S(A).
-
-
21. A method for identifying a set of uniquely expressed biological targets consistent with a vector of nonnegative signal values A, comprising:
-
identifying a set of targets S(A) consistent with the vector of nonnegative signal values A;
identifying a set of ambiguous biological targets DS(A) that may be expressed in the vector of nonnegative signal values A, but cannot be identified with certainty; and
outputting those targets that are in S(A), but not in DS(A), as the set of uniquely expressed biological targets consistent with the vector of nonnegative signal values A. - View Dependent Claims (22, 23, 25)
-
-
24. A computer program product configured to store plural computer program instructions which, when executed by a computer, causes the computer to determine a matrix of expression levels corresponding to a set of biological targets and a set of biological samples, by performing plural steps comprising:
-
obtaining a matrix of signal values P corresponding to the set of biological targets;
computing a vector of expression levels for a sample in the set of biological samples using the matrix of signal values P;
storing the vector of expression levels computed in the computing step in a storage matrix;
repeating the computing and storing steps for each sample in the set of biological samples; and
outputting the storage matrix as the matrix of expression levels.
-
Specification