Methods for using co-regulated genesets to enhance detection and classification of gene expression patterns
First Claim
1. A method of determining a disease profile that best matches a patient profile, comprising:
- (a) comparing, on a suitably programmed computer, a patient profile with reference profiles in a reference database to determine a measure of similarity between said patient profile and each said reference profiles, wherein said patient profile is obtained by projecting a first profile comprising measurements of a plurality of cellular constituents in a cell sample from said patient onto co-varying basis cellular constituent sets, and wherein said reference profiles are obtained by respectively projecting disease profiles each comprising measurements of said plurality of cellular constituents in a disease cell sample onto said co-varying basis cellular constituent sets, said co-varying basis cellular constituent sets being determined based upon co-variation of measurements of cellular constituents, under a plurality of different perturbations;
(b) identifying, on a suitably programmed computer, a reference profile in said reference database that best matches said patient profile based on a maximum similarity among the measures of similarity determined in step (a); and
(c) outputting to a user interface device, a computer readable storage medium, or a local or remote computer system;
or displaying, said maximum similarity or the disease of said disease cell sample of the reference profile in said reference database that best matches said patient profile.
3 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides methods for enhanced detection of biological response patterns. In one embodiment of the invention, genes are grouped into basis genesets according to the co-regulation of their expression. Expression of individual genes within a geneset is indicated with a single gene expression value for the geneset by a projection process. The expression values of genesets, rather than the expression of individual genes, are then used as the basis for comparison and detection of biological response with greatly enhanced sensitivity. In another embodiment of the invention, biological responses are grouped according to the similarity of their biological profile.
The methods of the invention have many useful applications, particularly in the fields of drug development and discovery. For example, the methods of the invention may be used to compare biological responses with greatly enhanced sensitivity. The biological responses that may be compared according to these methods include responses to single perturbations, such as a biological response to a mutation or temperature change, as well as graded perturbations such as titration with a particular drug. The methods are also useful to identify cellular constituents, particularly genes, associated with a particular type of biological response. Further, the methods may also be used to identify perturbations, such as novel drugs or mutations, which effect one or more particular genesets. The methods may still further be used to remove experimental artifacts in biological response data.
-
Citations
15 Claims
-
1. A method of determining a disease profile that best matches a patient profile, comprising:
-
(a) comparing, on a suitably programmed computer, a patient profile with reference profiles in a reference database to determine a measure of similarity between said patient profile and each said reference profiles, wherein said patient profile is obtained by projecting a first profile comprising measurements of a plurality of cellular constituents in a cell sample from said patient onto co-varying basis cellular constituent sets, and wherein said reference profiles are obtained by respectively projecting disease profiles each comprising measurements of said plurality of cellular constituents in a disease cell sample onto said co-varying basis cellular constituent sets, said co-varying basis cellular constituent sets being determined based upon co-variation of measurements of cellular constituents, under a plurality of different perturbations; (b) identifying, on a suitably programmed computer, a reference profile in said reference database that best matches said patient profile based on a maximum similarity among the measures of similarity determined in step (a); and (c) outputting to a user interface device, a computer readable storage medium, or a local or remote computer system;
or displaying, said maximum similarity or the disease of said disease cell sample of the reference profile in said reference database that best matches said patient profile. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
Specification