Systems and methods to detect rare mutations and copy number variation
DCFirst Claim
1. A method for quantifying single nucleotide variant tumor markers in cell-free DNA from a subject, comprising:
- (a) providing at least 10 ng of cell-free DNA obtained from a bodily sample of the subject;
(b) attaching tags comprising barcodes having from 5 to 1000 distinct barcode sequences to said cell-free DNA obtained from said bodily sample of the subject, to generate non-uniquely tagged parent polynucleotides, wherein each barcode sequence is at least 5 nucleotides in length;
(c) amplifying the non-uniquely tagged parent polynucleotides to produce amplified non-uniquely tagged progeny polynucleotides;
(d) sequencing the amplified non-uniquely tagged progeny polynucleotides to produce a plurality of sequence reads from each parent polynucleotide, wherein each sequence read comprises a barcode sequence and a sequence derived from cell-free DNA;
(e) grouping the plurality of sequence reads produced from each non-uniquely tagged parent polynucleotide into families based on i) the barcode sequence and ii) at least one of;
sequence information at a beginning of the sequence derived from cell-free DNA, sequence information at an end of the sequence derived from cell-free DNA, and length of the sequence read, whereby each family comprises sequence reads of non-uniquely tagged progeny polynucleotides amplified from a unique polynucleotide among the non-uniquely tagged parent polynucleotides;
(f) comparing the sequence reads grouped within each family to each other to determine consensus sequences for each family, wherein each of the consensus sequences corresponds to a unique polynucleotide among the non-uniquely tagged parent polynucleotides;
(g) providing one or more reference sequences from a human genome, said one or more reference sequences comprising one or more loci of reported tumor markers, wherein each of the reported tumor markers is a single nucleotide variant;
(h) identifying consensus sequences that map to a given locus of said one or more loci of reported tumor markers; and
(i) calculating a number of consensus sequences that map to the given locus that include the single nucleotide variant thereby quantifying single nucleotide variant tumor markers in said cell-free DNA from said subject.
1 Assignment
Litigations
1 Petition
Accused Products
Abstract
The present disclosure provides a system and method for the detection of rare mutations and copy number variations in cell free polynucleotides. Generally, the systems and methods comprise sample preparation, or the extraction and isolation of cell free polynucleotide sequences from a bodily fluid; subsequent sequencing of cell free polynucleotides by techniques known in the art; and application of bioinformatics tools to detect rare mutations and copy number variations as compared to a reference. The systems and methods also may contain a database or collection of different rare mutations or copy number variation profiles of different diseases, to be used as additional references in aiding detection of rare mutations, copy number variation profiling or general genetic profiling of a disease.
392 Citations
17 Claims
-
1. A method for quantifying single nucleotide variant tumor markers in cell-free DNA from a subject, comprising:
-
(a) providing at least 10 ng of cell-free DNA obtained from a bodily sample of the subject; (b) attaching tags comprising barcodes having from 5 to 1000 distinct barcode sequences to said cell-free DNA obtained from said bodily sample of the subject, to generate non-uniquely tagged parent polynucleotides, wherein each barcode sequence is at least 5 nucleotides in length; (c) amplifying the non-uniquely tagged parent polynucleotides to produce amplified non-uniquely tagged progeny polynucleotides; (d) sequencing the amplified non-uniquely tagged progeny polynucleotides to produce a plurality of sequence reads from each parent polynucleotide, wherein each sequence read comprises a barcode sequence and a sequence derived from cell-free DNA; (e) grouping the plurality of sequence reads produced from each non-uniquely tagged parent polynucleotide into families based on i) the barcode sequence and ii) at least one of;
sequence information at a beginning of the sequence derived from cell-free DNA, sequence information at an end of the sequence derived from cell-free DNA, and length of the sequence read, whereby each family comprises sequence reads of non-uniquely tagged progeny polynucleotides amplified from a unique polynucleotide among the non-uniquely tagged parent polynucleotides;(f) comparing the sequence reads grouped within each family to each other to determine consensus sequences for each family, wherein each of the consensus sequences corresponds to a unique polynucleotide among the non-uniquely tagged parent polynucleotides; (g) providing one or more reference sequences from a human genome, said one or more reference sequences comprising one or more loci of reported tumor markers, wherein each of the reported tumor markers is a single nucleotide variant; (h) identifying consensus sequences that map to a given locus of said one or more loci of reported tumor markers; and (i) calculating a number of consensus sequences that map to the given locus that include the single nucleotide variant thereby quantifying single nucleotide variant tumor markers in said cell-free DNA from said subject. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
Specification