MULTIPLE TAGGING OF LONG DNA FRAGMENTS
First Claim
Patent Images
1. A method for sequence analysis of a target nucleic acid comprising:
- (a) combining a plurality of long DNA fragments of the target nucleic acid with a population of tag-containing sequences, wherein the population comprises at least 1000 different tag sequences;
(b) producing tagged long fragments, wherein each tagged long fragment comprises target nucleic acid sequence and multiple interspersed tag sequences, wherein the multiple interspersed tag sequences in an individual tagged long fragment may be the same or different;
(c) producing from each tagged long fragment a plurality of tagged subfragments, wherein the tagged subfragments each comprise one or more tag sequences;
(d) obtaining sequence of individual tagged subfragments, wherein the obtained sequence includes target nucleic acid sequence and at least one tag sequence;
(e) combining sequences obtained in (d) to produce assembled sequence(s) of the target nucleic acid, wherein the combining comprises (i) determining that sequences obtained in (d) originated from the same long DNA fragment if said sequences comprise the same tag sequence and/or (ii) identifying pairs of sequences as being adjacent sequences in the target nucleic acid if the pair comprise the same tag sequence.
0 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides methods and compositions for tagging long fragments of a target nucleic acid for sequencing and analyzing the resulting sequence information in order to reduce errors and perform haplotype phasing, for example.
30 Citations
72 Claims
-
1. A method for sequence analysis of a target nucleic acid comprising:
-
(a) combining a plurality of long DNA fragments of the target nucleic acid with a population of tag-containing sequences, wherein the population comprises at least 1000 different tag sequences; (b) producing tagged long fragments, wherein each tagged long fragment comprises target nucleic acid sequence and multiple interspersed tag sequences, wherein the multiple interspersed tag sequences in an individual tagged long fragment may be the same or different; (c) producing from each tagged long fragment a plurality of tagged subfragments, wherein the tagged subfragments each comprise one or more tag sequences; (d) obtaining sequence of individual tagged subfragments, wherein the obtained sequence includes target nucleic acid sequence and at least one tag sequence; (e) combining sequences obtained in (d) to produce assembled sequence(s) of the target nucleic acid, wherein the combining comprises (i) determining that sequences obtained in (d) originated from the same long DNA fragment if said sequences comprise the same tag sequence and/or (ii) identifying pairs of sequences as being adjacent sequences in the target nucleic acid if the pair comprise the same tag sequence.
-
-
2-67. -67. (canceled)
-
68. A method for sequence analysis of one or more target nucleic acid molecules comprising:
-
(a) producing a population of subfragments of a single tagged long fragment of the target nucleic acid, wherein the tagged long fragment comprises target nucleic acid sequence and multiple interspersed tag sequences, wherein a majority of the subfragments comprise target nucleic acid sequence and at least one tag sequence; (b) obtaining sequence of individual tagged subfragments, wherein the obtained sequence includes target nucleic acid sequence and at least one tag sequence; (c) combining sequences obtained in (d) to produce assembled sequence(s) of the target nucleic acid, wherein the combining comprises (i) determining that sequences obtained in (d) originated from the same long DNA fragment if said sequences comprise the same tag sequence and/or (ii) identifying pairs of sequences as being adjacent sequences in the target nucleic acid if the pair comprise the same tag sequence.
-
-
69-70. -70. (canceled)
-
71. A method of sequencing a target nucleic acid comprising:
-
(a) combining in a single reaction vessel (i) a plurality of long fragments of the target nucleic acid, and (ii) a population of polynucleotides, wherein each polynucleotide comprises a tag and a majority of the polynucleotides comprise a different tag; (b) introducing into a majority of the long fragments tag-containing sequences from said population of polynucleotides to produced tagged long fragments, wherein each of the tagged long fragments comprises a plurality of the tag-containing sequences at a selected average spacing, and each tag-containing sequence comprises a tag; (c) producing a plurality of subfragments from each tagged long fragment, wherein each subfragment comprises one or more tags; (d) sequencing the subfragments to produce a plurality of sequence reads; (e) assign a majority of the sequence read to corresponding long fragments; and (f) assembling the sequence reads to produce an assembled sequence of the target nucleic acid.
-
-
72-95. -95. (canceled)
Specification