Methods for identifying DNA copy number changes
First Claim
1. A method of estimating the copy number of a gene in a genomic sample comprising:
- amplifying the genomic sample in an amplification reaction comprising random primers and a strand displacing DNA polymerase to obtain an amplified sample;
fragmenting and labeling the amplified sample;
hybridizing the labeled fragments to an array of probes comprising a plurality of probes, wherein said plurality of probes comprises at least 100,000 different sequence probes wherein each probe is present on the array at a feature of known or determinable location and wherein each probe in the plurality is complementary to an expressed region of a gene;
analyzing a resulting hybridization pattern to obtain a hybridization intensity measurement for each of a plurality of features of the array;
comparing the hybridization intensity measurement for each feature to an expected hybridization intensity measurement for that feature; and
estimating the copy number of one or more genes based on the hybridization intensity measurement.
0 Assignments
0 Petitions
Accused Products
Abstract
Methods and computer software products for identifying changes in genomic DNA copy number are disclosed. Methods for identifying homozygous deletions and genetic amplifications are disclosed. Genomic DNA is amplified generically and amplified sample is hybridized to an expression array. The expression array comprises probes to regions of genes that are expressed. The probes are complementary to genomic sequences found in mRNAs. Signal intensity is correlated to copy number. The methods may be used to detect copy number changes in cancerous tissue compared to normal tissue. The methods may be used to diagnose cancer and other diseases associated with chromosomal anomalies.
127 Citations
22 Claims
-
1. A method of estimating the copy number of a gene in a genomic sample comprising:
-
amplifying the genomic sample in an amplification reaction comprising random primers and a strand displacing DNA polymerase to obtain an amplified sample;
fragmenting and labeling the amplified sample;
hybridizing the labeled fragments to an array of probes comprising a plurality of probes, wherein said plurality of probes comprises at least 100,000 different sequence probes wherein each probe is present on the array at a feature of known or determinable location and wherein each probe in the plurality is complementary to an expressed region of a gene;
analyzing a resulting hybridization pattern to obtain a hybridization intensity measurement for each of a plurality of features of the array;
comparing the hybridization intensity measurement for each feature to an expected hybridization intensity measurement for that feature; and
estimating the copy number of one or more genes based on the hybridization intensity measurement. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for estimating genomic copy number at a plurality of genomic regions in a genomic sample comprising:
-
amplifying the genomic sample to obtain an amplified genomic sample;
fragmenting the amplified genomic sample to obtain fragments;
labeling the fragments;
hybridizing the fragments to an expression array to generate a hybridization pattern, wherein the expression array comprises at least 10,000 probe sets;
analyzing the hybridization pattern to obtain a plurality of probe set signals, wherein a probe set signal is a normalized measurement of the hybridization signal for a probe set;
calculating a Z-score for each probe set using a mean and standard deviation calculated from a training data set;
mapping the chromosomal location of each probe set to obtain a plurality of mapped probe sets that map to a single chromosomal location;
calculating a Stouffer Z-score for each mapped probe set; and
identifying chromosomal regions of amplification or deletion based on Stouffer Z-score. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 22)
-
-
21. A computer software product for analyzing hybridization data for a genomic sample hybridized to an expression array, comprising a computer readable medium having computer-executable instructions for performing logic steps comprising:
-
inputting probe intensities from probes designed to interrogate for the presence of mRNA transcripts;
obtaining a normalized signal for a plurality of probe sets;
partitioning the data into a training set and a test set;
generating a signal mean and a standard deviation from the training set;
generating a Z-score for a plurality of probe sets;
identifying probe sets that map to chromosomal locations and comparing Z-scores for probe sets to estimate copy number for selected genomic regions.
-
Specification