SYSTEMS AND METHODS TO DETECT RARE MUTATIONS AND COPY NUMBER VARIATION
First Claim
1. A system comprising:
- a communication interface that receives, over a communication network, sequencing reads comprising cfDNA molecule sequences and non-unique molecular barcode sequences generated by a nucleic acid sequencer; and
a computer in communication with the communication interface, wherein the computer comprises one or more computer processors and a computer readable medium comprising machine-executable code that, upon execution by the one or more computer processors, implements a method comprising;
(i) receiving, over the communication network, the sequencing reads generated by the nucleic acid sequencer;
(ii) mapping the sequence reads to one or more reference sequences from a human genome;
(iii) grouping the sequence reads into families, each of the families comprising sequence reads comprising identical barcode sequences and having the same start and stop positions, wherein each of the families comprises sequencing reads corresponding to a same original cell free deoxyribonucleic acid (cfDNA) molecule;
(iv) at each genetic locus of a plurality of genetic loci in the one or more reference sequences, collapsing sequence reads in each family to yield a base call for each family at the genetic locus; and
(v) measuring a frequency of one or more bases called at the locus from among the families.
1 Assignment
0 Petitions
Accused Products
Abstract
The present disclosure provides a system and method for the detection of rare mutations and copy number variations in cell free polynucleotides. Generally, the systems and methods comprise sample preparation, or the extraction and isolation of cell free polynucleotide sequences from a bodily fluid; subsequent sequencing of cell free polynucleotides by techniques known in the art; and application of bioinformatics tools to detect rare mutations and copy number variations as compared to a reference. The systems and methods also may contain a database or collection of different rare mutations or copy number variation profiles of different diseases, to be used as additional references in aiding detection of rare mutations, copy number variation profiling or general genetic profiling of a disease.
1 Citation
30 Claims
-
1. A system comprising:
-
a communication interface that receives, over a communication network, sequencing reads comprising cfDNA molecule sequences and non-unique molecular barcode sequences generated by a nucleic acid sequencer; and a computer in communication with the communication interface, wherein the computer comprises one or more computer processors and a computer readable medium comprising machine-executable code that, upon execution by the one or more computer processors, implements a method comprising; (i) receiving, over the communication network, the sequencing reads generated by the nucleic acid sequencer; (ii) mapping the sequence reads to one or more reference sequences from a human genome; (iii) grouping the sequence reads into families, each of the families comprising sequence reads comprising identical barcode sequences and having the same start and stop positions, wherein each of the families comprises sequencing reads corresponding to a same original cell free deoxyribonucleic acid (cfDNA) molecule; (iv) at each genetic locus of a plurality of genetic loci in the one or more reference sequences, collapsing sequence reads in each family to yield a base call for each family at the genetic locus; and (v) measuring a frequency of one or more bases called at the locus from among the families. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A system comprising:
-
a communication interface that receives, over a communication network, sequencing reads comprising cfDNA molecule sequences and non-unique molecular barcode sequences generated by a nucleic acid sequencer; and a computer in communication with the nucleic acid sequencer over a communication network, wherein the computer comprises one or more computer processors and a computer readable medium comprising non-transitory machine-executable code that, upon execution by the one or more computer processors, reduces noise and distortion in the signal by; (i) receiving, over the communication network, the sequencing reads generated by the nucleic acid sequencer; (ii) mapping the sequence reads to one or more reference sequences from a human genome; (iii) grouping the sequence reads into families, each of the families comprising sequence reads comprising identical barcodes and having the same start and stop positions, wherein each of the families comprises sequencing reads corresponding to a same original cell free deoxyribonucleic acid (cfDNA) molecule from the cfDNA molecules; (iv) at each genetic locus of a plurality of genetic loci in the one or more reference sequences; (1) collapsing sequence reads in each family to yield a base call for each family at the genetic locus; and
measuring a frequency of one or more bases called at the locus from among the families; and(2) counting families mapping to each genetic locus of a plurality of genetic loci. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30)
-
Specification