Methods and compositions for determining gene function
First Claim
1. A method for characterizing a cellular constituent as being associated or not being associated with a biological function, said method comprises identifying one or more response profiles associated with a known biological function as either correlating or not correlating with a response profile for the cellular constituent being characterized, wherein:
- (a) each of said one or more response profiles associated with said known biological function comprises changes of a plurality of cellular constituents in a biological sample in which a particular cellular constituent, other than the cellular constituent being characterized, that is associated with said known biological function is perturbed, (b) said response profile for the cellular constituent being characterized comprises changes of measured amounts of a plurality cellular constituents in a biological sample in which the cellular constituent being characterized is perturbed, and (c) either the cellular constituent being characterized is characterized as being associated with said known biological function if said response profile for said cellular constituent being characterized correlates with said response profile associated with said known biological function, or the cellular constituent being characterized is characterized as not being associated with said known biological function if said response profile for said cellular constituent being characterized does not correlate with said response profile associated with said known biological function.
3 Assignments
0 Petitions
Accused Products
Abstract
The invention relates to methods and systems (e.g., computer systems and computer program products) for characterizing cellular constituents, particularly genes and gene products. In particular, the invention provides methods for assigning or determining the biological function of uncharacterized genes and gene products by using “response profiles,” i.e., measurements of pluralities of cellular constituents in cells having a modified gene or gene product, as phenotypic markers for the gene or gene product. Methods are provided for clustering such response profiles so that similar or correlated response profiles are organized into the same cluster. The invention also provides databases or “compendiums” of response profiles to which the response profile of an uncharacterized gene or gene product can compared.
49 Citations
85 Claims
-
1. A method for characterizing a cellular constituent as being associated or not being associated with a biological function, said method comprises identifying one or more response profiles associated with a known biological function as either correlating or not correlating with a response profile for the cellular constituent being characterized, wherein:
-
(a) each of said one or more response profiles associated with said known biological function comprises changes of a plurality of cellular constituents in a biological sample in which a particular cellular constituent, other than the cellular constituent being characterized, that is associated with said known biological function is perturbed, (b) said response profile for the cellular constituent being characterized comprises changes of measured amounts of a plurality cellular constituents in a biological sample in which the cellular constituent being characterized is perturbed, and (c) either the cellular constituent being characterized is characterized as being associated with said known biological function if said response profile for said cellular constituent being characterized correlates with said response profile associated with said known biological function, or the cellular constituent being characterized is characterized as not being associated with said known biological function if said response profile for said cellular constituent being characterized does not correlate with said response profile associated with said known biological function. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 82)
-
-
23. A method for characterizing a cellular constituent as being associated or not associated with a particular biological function, in which said method comprises:
-
(a) clustering a plurality of response profiles, wherein each response profile in said plurality of response profiles comprises changes in measured amounts of a plurality of cellular constituents in a biological sample in which a particular cellular constituent is perturbed or modified, and said plurality of response profiles includes a response profile for the cellular constituent being characterized, said response profile for the cellular constituent being characterized comprising changes in measured amounts of a plurality of cellular constituents expressed in a biological sample in which the cellular constituent being characterized is perturbed or modified; and
(b) identifying one or more response profiles in said plurality of response profiles that cluster with the response profiles for the cellular constituent being characterized, said identified response profiles being associated with a known biological function, or identifying one or more response profiles in said plurality of response profiles that do not cluster with the response profiles for the cellular constituent being characterized, said identified response profiles being associated with a known biological function, wherein if said response profile associated with the cellular constituent being characterized is identified as clustering with said response profiles being associated with a known biological function, said cellular constituent is characterized as being associated with said known biological function, and if said response profile associated with the cellular constituent being characterized is identified as not clustering with said response profiles being associated with a known biological function, said cellular constituent is characterized as not being associated with said known biological function. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 83)
-
-
33. A computer system for characterizing cellular constituents, said computer system comprising:
-
one or more processor units; and
one or more memory units connected to said one or more processor units, said one or more memory units containing one or more programs which cause said one or more processor units to execute steps of;
(a) receiving a data structure for a response profile of a cellular constituent to be characterized, said response profile of a cellular constituent to be characterized comprising changes of measured amounts of a plurality of cellular constituents in a biological sample in which the cellular constituent to be characterized is perturbed; and
(b) identifying one or more response profiles associated with a known biological function that correlate or do not correlate with said response profile of the cellular constituent to be characterized, wherein each of the one or more response profiles associated with said known biological function comprises changes of a plurality of cellular constituents in a biological sample in which a particular cellular constituent, other than the cellular constituent to be characterized, that is associated with said known biological function is perturbed, wherein if said response profile of the cellular constituent to be characterized correlates with said one or more response profiles associated with a known biological function, the cellular constituent to be characterized is characterized as being associated with said known biological function, and if said response profile of the cellular constituent to be characterized does not correlate with said one or more response profiles associated with a known biological function, the cellular constituent to be characterized is characterized as not being associated with said known biological function. - View Dependent Claims (34, 35)
-
-
36. A computer program product for use in conjunction with a computer having one or more memory units and one or more processor units, the computer program product comprising a computer readable storage medium having a computer program mechanism encoded thereon, wherein said computer program mechanism can be loaded into the one or more memory units of a computer and cause the one or more processor units of the computer to execute steps of:
-
(a) receiving a data structure for a response profile of a cellular constituent to be characterized, said response profile of a cellular constituent to be characterized comprising changes of measured amounts of a plurality of cellular constituents in a biological sample in which the cellular constituent to be characterized is perturbed; and
(b) identifying one or more response profiles associated with a known biological function that correlate with said response profile of the cellular constituent to be characterized, wherein each of the one or more response profiles associated with said known biological function comprises changes of a plurality of cellular constituents in a biological sample in which a particular cellular constituent, other than the cellular constituent to be characterized, that is associated with said known biological function is perturbed. - View Dependent Claims (37, 38)
-
-
39. A method for determining a biological function with which a cellular constituent of a cell type or organism is associated, comprising:
-
(a) determining measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism in which said cellular constituent has been perturbed to create a first response profile;
(b) comparing said first response profile, or a predicted response profile derived therefrom, to a database comprising a plurality of landmark response profiles to determine the one or more landmark response profiles most similar to said first or predicted response profile, each landmark response profile comprising measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation in a cellular constituent associated with a known biological function, wherein the known biological function of the cellular constituent perturbed in the one or more landmark response profiles determined in step (b) is the biological function with which said cellular constituent is associated. - View Dependent Claims (41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 84)
-
-
40. A method for determining a biological function with which a cellular constituent of a cell type or organism is associated, comprising:
-
comparing a first response profile or a predicted response profile derived therefrom to a database comprising a plurality of landmark response profiles to determine the one or more landmark response profiles most similar to said first or predicted response profile;
wherein said first response profile comprises measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism in which said cellular constituent has been perturbed;
wherein each landmark response profile comprises measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation to a cellular constituent associated with a known biological function; and
wherein the known biological function of the cellular constituent perturbed in the one or more landmark response profiles determined to be most similar is the biological function with which said cellular constituent is associated.
-
-
56. A method for characterizing a cellular constituent as being associated with a particular biological function, comprising:
-
(a) determining measured amounts of a plurality of cellular constituents in a first cell of a cell type or organism in which said cellular constituent being characterized is perturbed or modified to create a first response profile;
(b) clustering a plurality of response profiles, which comprise said first response profile and a plurality of landmark response profiles, each landmark response profile comprising measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation or modification in a cellular constituent associated with a known biological function; and
(c) identifying one or more landmark response profiles in said plurality of landmark response profiles that cluster with the first response profile for the cellular constituent being characterized, said identified landmark response profiles being associated with a known biological function, wherein the cellular constituent being characterized is characterized as being associated with said known biological function. - View Dependent Claims (58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 85)
-
-
57. A method for characterizing a cellular constituent as being associated with a particular biological function, comprising:
-
(a) clustering a plurality of response profiles, which comprise;
(i) a first response profile comprising measured amounts of a plurality of cellular constituents in a first cell of a cell type or organism in which said cellular constituent being characterized is perturbed or modified; and
(ii) a plurality of landmark response profiles, each landmark response profile comprising measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation or modification in a cellular constituent associated with a known biological function; and
(c) identifying one or more landmark response profiles in said plurality of landmark response profiles that cluster with the first response profile for the cellular constituent being characterized, said identified landmark response profiles being associated with a known biological function, wherein the cellular constituent being characterized is characterized as being associated with said known biological function.
-
-
70. A computer system for identifying a biological function with which a cellular constituent is associated, said computer system comprising:
-
one or more processor units; and
one or more memory units connected to said one or more processor units, said one or more memory units containing one or more programs which cause said one or more processor units to execute steps of;
(a) receiving a data structure for a first response profile comprising measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism in which said cellular constituent has been perturbed; and
(b) comparing said first response profile, or a predicted response profile derived therefrom, to a database comprising a plurality of landmark response profiles to determine the one or more landmark response profiles most similar to said first or predicted response profile, each landmark response profile comprising measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation in a cellular constituent associated with a known biological function, wherein the known biological function of the cellular constituent perturbed in the one or more landmark response profiles determined in step (b) is the biological function with which said cellular constituent is associated. - View Dependent Claims (72, 73, 74, 75)
-
-
71. A computer system for identifying a biological function with which a cellular constituent is associated, said computer system comprising:
-
one or more processor units; and
one or more memory units connected to said one or more processor units, said one or more memory units containing one or more programs which cause said one or more processor units to execute steps of;
comparing a first response profile or a predicted response profile derived therefrom to a database comprising a plurality of landmark response profiles to determine the one or more landmark response profiles most similar to said first or predicted response profile;
wherein said first response profile comprises measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism in which said cellular constituent has been perturbed;
wherein each landmark response profile comprises measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation to a cellular constituent associated with a known biological function; and
wherein the known biological function of the cellular constituent perturbed in the one or more landmark response profiles determined to be most similar is the biological function with which said cellular constituent is associated.
-
-
76. A computer program product for use in conjunction with a computer having one or more memory units and one or more processor units, the computer program product comprising a computer readable storage medium having a computer program mechanism encoded thereon, wherein said computer program mechanism can be loaded into the one or more memory units of a computer and cause the one or more processor units of the computer to execute steps of:
-
(a) receiving a data structure for a first response profile comprising measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism in which said cellular constituent has been perturbed; and
(b) comparing said first response profile, or a predicted response profile derived therefrom, to a database comprising a plurality of landmark response profiles to determine the one or more landmark response profiles most similar to said first or predicted response profile, each landmark response profile comprising measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation in a cellular constituent associated with a known biological function, wherein the known biological function of the cellular constituent perturbed in the one or more landmark response profiles determined in step (b) is the biological function with which said cellular constituent is associated. - View Dependent Claims (78, 79, 80, 81)
-
-
77. A computer program product for use in conjunction with a computer having one or more memory units and one or more processor units, the computer program product comprising a computer readable storage medium having a computer program mechanism encoded thereon, wherein said computer program mechanism can be loaded into the one or more memory units of a computer and cause the one or more processor units of the computer to execute steps of:
-
comparing a first response profile or a predicted response profile derived therefrom to a database comprising a plurality of landmark profiles to determine the one or more landmark response profiles most similar to said first or predicted response profile;
wherein said first response profile comprises measured amounts of a plurality of cellular constituents in a first cell of said cell type or of said organism in which said cellular constituent has been perturbed;
wherein each landmark response profile comprises measured amounts of a plurality of cellular constituents in a second cell of said cell type or type of organism having a perturbation to a cellular constituent associated with a known biological function; and
wherein the known biological function of the cellular constituent perturbed in the one or more landmark response profiles determined to be most similar is the biological function with which said cellular constituent is associated.
-
Specification