Systems and methods to detect rare mutations and copy number variation
DCFirst Claim
1. A method for generating a genetic profile of a tumor from a blood sample of double-stranded cell-free deoxyribonucleic acids (cfDNA) molecules from a subject having cancer or suspected of having a cancer, the method comprising:
- (a) obtaining a population comprising the double-stranded cfDNA molecules from the blood sample from the subject;
(b) ligating a set of molecular barcodes to both ends of a plurality of the double-stranded cfDNA molecules using more than a 30×
molar excess of molecular barcodes relative to the double-stranded cfDNA molecules to produce tagged parent polynucleotides, wherein a given molecular barcode is a member of a set of molecular barcodes comprising 2 to 1,000,000 different molecular barcode sequences,wherein at least 20% of the double-stranded cfDNA molecules from the population of cfDNA molecules are attached to molecular barcodes;
(c) amplifying a plurality of the tagged parent polynucleotides to produce progeny polynucleotides with associated molecular barcodes;
(d) selectively enriching a subset of the progeny polynucleotides for target regions associated with cancer, whereby enriched progeny polynucleotides are generated;
(e) sequencing a portion of the enriched progeny polynucleotides to produce sequencing reads of the progeny polynucleotides with associated molecular barcodes;
(f) aligning a plurality of the sequencing reads to a reference sequence;
(g) grouping a plurality of the sequencing reads into a plurality of families based at least on sequence information of the molecular barcodes, a start base position of a given sequencing read from among the sequencing reads at which the given sequencing read is determined to start aligning to the reference sequence, and a stop base position of the given sequencing read at which the given sequencing read is determined to stop aligning to the reference sequence, wherein a family of the plurality of families is representative of a cell-free nucleic acid molecule in the sample;
(h) detecting, from among the families, the presence or absence of somatic genetic variants; and
(i) quantifying a plurality of somatic genetic variants detected as present to generate the genetic profile of the tumor.
1 Assignment
Litigations
0 Petitions
Accused Products
Abstract
The present disclosure provides a system and method for the detection of rare mutations and copy number variations in cell free polynucleotides. Generally, the systems and methods comprise sample preparation, or the extraction and isolation of cell free polynucleotide sequences from a bodily fluid; subsequent sequencing of cell free polynucleotides by techniques known in the art; and application of bioinformatics tools to detect rare mutations and copy number variations as compared to a reference. The systems and methods also may contain a database or collection of different rare mutations or copy number variation profiles of different diseases, to be used as additional references in aiding detection of rare mutations, copy number variation profiling or general genetic profiling of a disease.
321 Citations
30 Claims
-
1. A method for generating a genetic profile of a tumor from a blood sample of double-stranded cell-free deoxyribonucleic acids (cfDNA) molecules from a subject having cancer or suspected of having a cancer, the method comprising:
-
(a) obtaining a population comprising the double-stranded cfDNA molecules from the blood sample from the subject; (b) ligating a set of molecular barcodes to both ends of a plurality of the double-stranded cfDNA molecules using more than a 30×
molar excess of molecular barcodes relative to the double-stranded cfDNA molecules to produce tagged parent polynucleotides, wherein a given molecular barcode is a member of a set of molecular barcodes comprising 2 to 1,000,000 different molecular barcode sequences,wherein at least 20% of the double-stranded cfDNA molecules from the population of cfDNA molecules are attached to molecular barcodes; (c) amplifying a plurality of the tagged parent polynucleotides to produce progeny polynucleotides with associated molecular barcodes; (d) selectively enriching a subset of the progeny polynucleotides for target regions associated with cancer, whereby enriched progeny polynucleotides are generated; (e) sequencing a portion of the enriched progeny polynucleotides to produce sequencing reads of the progeny polynucleotides with associated molecular barcodes; (f) aligning a plurality of the sequencing reads to a reference sequence; (g) grouping a plurality of the sequencing reads into a plurality of families based at least on sequence information of the molecular barcodes, a start base position of a given sequencing read from among the sequencing reads at which the given sequencing read is determined to start aligning to the reference sequence, and a stop base position of the given sequencing read at which the given sequencing read is determined to stop aligning to the reference sequence, wherein a family of the plurality of families is representative of a cell-free nucleic acid molecule in the sample; (h) detecting, from among the families, the presence or absence of somatic genetic variants; and (i) quantifying a plurality of somatic genetic variants detected as present to generate the genetic profile of the tumor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 30)
-
-
16. A method for quantifying somatic genetic variants from a bodily fluid sample of cell-free nucleic acid molecules from a subject, the method comprising:
-
(a) ligating a set of molecular barcodes to both ends of a plurality of cell-free nucleic acid molecules from a population of the cell-free nucleic acid molecules obtained from the bodily fluid sample using more than a 30×
molar excess of molecular barcodes relative to the cell-free nucleic acid molecules to produce tagged parent polynucleotides,wherein at least 20% of the cell-free nucleic acid molecules from the population of cell-free nucleic acid molecules are attached to molecular barcodes; (b) amplifying a plurality of the tagged parent polynucleotides to produce progeny polynucleotides with associated molecular barcodes; (c) sequencing a portion of the progeny polynucleotides to produce sequencing reads of the progeny polynucleotides with associated molecular barcodes; (d) detecting, from among a plurality of the sequencing reads, the presence or absence of somatic genetic variants; and (e) quantifying a plurality of the somatic genetic variants detected as present. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
Specification