SEQUENCE TAG DIRECTED SUBASSEMBLY OF SHORT SEQUENCING READS INTO LONG SEQUENCING READS
First Claim
Patent Images
1. A method for generating sequence assemblies from short sequencing reads, comprising:
- a) fragmenting at least one member of an input library to produce a plurality of linear DNA fragments having a first fragment end and a second fragment end proximal to a fragmentation breakpoint,b) attaching a common nucleic acid adaptor to the first and second linear DNA fragment ends proximal to a fragmentation breakpoint, wherein the common adaptor comprise the same unique sequence tag,c) optionally amplifying the plurality of linear DNA fragments to produce a sequencing library comprising a plurality of amplified DNA fragments, wherein at least one of the plurality of amplified DNA fragments comprises;
i) sequence complementary to at least the unique sequence tag of an adaptor, andii) sequence complementary to at least a portion of a member of the input library,d) sequencing at least a portion of the DNA fragments, wherein the presence of a unique adaptor sequence tag in a plurality of fragment sequences thereby associates the fragment sequences having ends that were proximal to the same fragmentation breakpoint, ande) assembling the plurality of breakpoint tag-associated fragment sequences, or subassembly sequences comprising breakpoint-associated sequences, to generate longer subassembly sequences of the input library.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention provides methods for preparing DNA sequencing libraries by assembling short read sequencing data into longer contiguous sequences for genome assembly, full length cDNA sequencing, metagenomics, and the analysis of repetitive sequences of assembled genomes.
-
Citations
20 Claims
-
1. A method for generating sequence assemblies from short sequencing reads, comprising:
-
a) fragmenting at least one member of an input library to produce a plurality of linear DNA fragments having a first fragment end and a second fragment end proximal to a fragmentation breakpoint, b) attaching a common nucleic acid adaptor to the first and second linear DNA fragment ends proximal to a fragmentation breakpoint, wherein the common adaptor comprise the same unique sequence tag, c) optionally amplifying the plurality of linear DNA fragments to produce a sequencing library comprising a plurality of amplified DNA fragments, wherein at least one of the plurality of amplified DNA fragments comprises; i) sequence complementary to at least the unique sequence tag of an adaptor, and ii) sequence complementary to at least a portion of a member of the input library, d) sequencing at least a portion of the DNA fragments, wherein the presence of a unique adaptor sequence tag in a plurality of fragment sequences thereby associates the fragment sequences having ends that were proximal to the same fragmentation breakpoint, and e) assembling the plurality of breakpoint tag-associated fragment sequences, or subassembly sequences comprising breakpoint-associated sequences, to generate longer subassembly sequences of the input library. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification