SAFE SEQUENCING SYSTEM
First Claim
1. A method to analyze nucleic acid sequences, comprising:
- attaching a unique identifier nucleic acid sequence (UID) to a first end of each of a plurality of analyte nucleic acid fragments to form uniquely identified analyte nucleic acid fragments;
redundantly determining nucleotide sequence of a uniquely identified analyte nucleic acid fragment, wherein determined nucleotide sequences which share a UID form a family of members;
identifying a nucleotide sequence as accurately representing an analyte nucleic acid fragment when at least 1% of members of the family contain the sequence.
3 Assignments
0 Petitions
Accused Products
Abstract
The identification of mutations that are present in a small fraction of DNA templates is essential for progress in several areas of biomedical research. Though massively parallel sequencing instruments are in principle well-suited to this task, the error rates in such instruments are generally too high to allow confident identification of rare variants. We here describe an approach that can substantially increase the sensitivity of massively parallel sequencing instruments for this purpose. One example of this approach, called “Safe-SeqS” for (Safe-Sequencing System) includes (i) assignment of a unique identifier (UID) to each template molecule; (ii) amplification of each uniquely tagged template molecule to create UID-families; and (iii) redundant sequencing of the amplification products. PCR fragments with the same UID are truly mutant (“super-mutants”) if ≧95% of them contain the identical mutation. We illustrate the utility of this approach for determining the fidelity of a polymerase, the accuracy of oligonucleotides synthesized in vitro, and the prevalence of mutations in the nuclear and mitochondrial genomes of normal cells.
233 Citations
50 Claims
-
1. A method to analyze nucleic acid sequences, comprising:
-
attaching a unique identifier nucleic acid sequence (UID) to a first end of each of a plurality of analyte nucleic acid fragments to form uniquely identified analyte nucleic acid fragments; redundantly determining nucleotide sequence of a uniquely identified analyte nucleic acid fragment, wherein determined nucleotide sequences which share a UID form a family of members; identifying a nucleotide sequence as accurately representing an analyte nucleic acid fragment when at least 1% of members of the family contain the sequence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 49, 50)
-
-
23. A method to analyze nucleic acid sequences, comprising:
-
attaching a unique identifier sequence (UID) to a first end of each of a plurality of analyte DNA fragments using at least two cycles of amplification with first and second primers to form uniquely identified analyte DNA fragments, wherein the UIDs are in excess of the analyte DNA fragments during amplification, wherein the first primers comprise; a first segment complementary to a desired amplicon; a second segment containing the UID; a third segment containing a universal priming site for subsequent amplification; and wherein the second primers comprise a universal priming site for subsequent amplification;
wherein each cycle of amplification attaches one universal priming site to a strand;amplifying the uniquely identified analyte DNA fragments to form a family of uniquely identified analyte DNA fragments from each uniquely identified analyte DNA fragment; and determining nucleotide sequences of a plurality of members of the family. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
-
-
36. A method to analyze DNA using endogenous unique identifier sequences (UIDs), comprising:
-
attaching adapter oligonucleotides to ends of fragments of analyte DNA of between 30 to 2000 bases, inclusive, to form adapted fragments, wherein each end of a fragment before said attaching is an endogenous UID for the fragment; amplifying the adapted fragments using primers complementary to the adapter oligonucleotides to form families of adapted fragments; determining nucleotide sequence of a plurality of members of a family; comparing nucleotide sequences of the plurality of members of the family; and identifying a nucleotide sequence as accurately representing an analyte DNA fragment when at least 1% of members of the family contain the sequence. - View Dependent Claims (37, 38, 39, 40, 41)
-
-
42. A population of primer pairs, wherein each pair comprises a first and second primer for amplifying and identifying a gene or gene portion, wherein:
-
the first primer comprises a first portion of 10-100 nucleotides complementary to the gene or gene portion and a second portion of 10 to 100 nucleotides comprising a site for hybridization to a third primer; the second primer comprises a first portion of 10-100 nucleotides complementary to the gene or gene portion and a second portion of 10 to 100 nucleotides comprising a site for hybridization to a fourth primer, wherein interposed between the first portion and the second portion of the second primer is a third portion consisting of 2 to 4000 nucleotides forming a unique identifier (UID); wherein the unique identifiers in the population have at least 4 different sequences, wherein the first and second primers are complementary to opposite strands of the gene or gene portion. - View Dependent Claims (43, 44, 45, 46, 47, 48)
-
Specification