High throughput genome sequencing on DNA arrays
First Claim
1. A method of determining the identification of a first nucleotide at a detection position of a target sequence comprising a plurality of detection positions, said method comprising:
- (a) providing a plurality of concatemers, wherein each concatemer comprises a plurality of monomers and each monomer comprises;
i) a first target domain of said target sequence comprising a first set of target detection positions;
ii) a first adaptor comprising a Type IIs endonuclease restriction site;
iii) a second target domain of said target sequence comprising a second set of target detection positions; and
iv) a second interspersed adaptor comprising a Type IIs endonuclease restriction site;
(b) identifying said first nucleotide.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention is directed to methods and compositions for acquiring nucleotide sequence information of target sequences using adaptors interspersed in target polynucleotides. The sequence information can be new, e.g. sequencing unknown nucleic acids, re-sequencing, or genotyping. The invention preferably includes methods for inserting a plurality of adaptors at spaced locations within a target polynucleotide or a fragment of a polynucleotide. Such adaptors may serve as platforms for interrogating adjacent sequences using various sequencing chemistries, such as those that identify nucleotides by primer extension, probe ligation, and the like. Encompassed in the invention are methods and compositions for the insertion of known adaptor sequences into target sequences, such that there is an interruption of contiguous target sequence with the adaptors. By sequencing both “upstream” and “downstream” of the adaptors, identification of entire target sequences may be accomplished.
-
Citations
42 Claims
-
1. A method of determining the identification of a first nucleotide at a detection position of a target sequence comprising a plurality of detection positions, said method comprising:
-
(a) providing a plurality of concatemers, wherein each concatemer comprises a plurality of monomers and each monomer comprises; i) a first target domain of said target sequence comprising a first set of target detection positions; ii) a first adaptor comprising a Type IIs endonuclease restriction site; iii) a second target domain of said target sequence comprising a second set of target detection positions; and iv) a second interspersed adaptor comprising a Type IIs endonuclease restriction site; (b) identifying said first nucleotide. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A substrate comprising a plurality of immobilized concatemers, each monomer of said concatemer comprising:
-
a) a first target sequence; b) a first adaptor comprising a Type IIs endonuclease restriction site; c) a second target sequence; and d) a second interspersed adaptor comprising a Type IIs endonuclease restriction site. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A method of inserting multiple adaptors in a target sequence comprising:
-
(a) ligating a first adaptor to one terminus of said target sequence, wherein the adaptor comprises a binding site for a restriction enzyme; (b) circularizing the product from step (i) to create a first circular polynucleotide; (c) cleaving the circular polynucleotide with a restriction enzyme, wherein the restriction enzyme is able to bind to the binding site within the first adaptor; (d) ligating a second adaptor, wherein said second adaptor comprises a binding site for a restriction enzyme; (e) circularizing the product from step (d) to create a second circular polynucleotide; wherein steps (c) through (e) are optionally repeated to insert a desired number of adaptors in the target sequence. - View Dependent Claims (29, 30, 31, 32)
-
-
33. A method for identifying a nucleotide sequence of a target sequence, the method comprising the steps of:
-
(a) providing a plurality of interspersed adaptors within a target sequence, each interspersed adaptor having at least one boundary with the target sequence; and (b) determining the identity of at least one nucleotide adjacent to at least one boundary of at least two interspersed adaptors, thereby identifying a nucleotide sequence of the target sequence.
-
- 34. A library of polynucleotides comprising more than one nucleic acid fragment, each fragment comprising a plurality of interspersed adaptor, wherein each interspersed adaptor has t least one end having different non-cross-hybridizable sequence with respect to the sequences of every other interspersed adaptor of the plurality.
-
38. A method of identifying a nucleotide sequence of a target sequence, the method comprising the steps of:
-
(a) providing an amplicon from each of a plurality of fragments of the target sequence, each fragment containing a plurality of interspersed adaptors at predetermined sites, and each amplicon comprising multiple copies of a fragment and the amplicons including a number of fragments that substantially covers the target sequence; (b) providing a random array of amplicons fixed to a surface at a density such that at least a majority of the amplicons are optically resolvable; (c) hybridizing one or more sequencing probes to the random array under conditions that permit the formation of perfectly matched duplexes between the one or more sequencing probes and complementary sequences on the interspersed adaptors, (d) identifying at least one nucleotide adjacent to at least one interspersed adaptor by extending the one or more sequencing probes in a sequence specific reaction; and (e) repeating steps (c) and (d) until a nucleotide sequence of the target sequence is identified.
-
-
39. A method of identifying a nucleotide sequence of a target sequence, the method comprising the steps of:
-
(a) providing a random array of concatemers fixed to a planar surface, wherein said surface has an array of optically resolvable discrete spaced apart regions, and wherein each discrete spaced apart region has an area of less than 1 μ
m2 and substantially all such regions have at most one of said concatemers attached, each concatemer comprising multiple copies of a fragment of the target sequence, each such fragment continuing a plurality of interspersed adaptors at predetermined sites, and the number of different concatemers such that their respective fragments substantially cover the target polynucleotide;(b) hybridizing one or more probes from a first set of probes to the random array under conditions that permit the formation of perfectly matched duplexes between the one or more probes and complementary sequences on the concatemers; (c) hybridizing one or more probes from a second set of probes to the random array under conditions that permit the formation of perfectly matched duplexes between the one or more probes and complementary sequences on the concatemers; (d) ligating probes from the first and second sets which are hybridized to a concatemer at contiguous sites; (e) identifying the sequences of the ligated probes; and (f) repeating steps (b) through (e) to identify the nucleotide sequence of the target sequence.
-
-
40. A method of identifying a nucleotide sequence of a target sequence, the method comprising the steps of:
-
(a) providing a plurality of concatemers from the target sequence, each concatemer comprising multiple copies of a fragment of the target sequence, each fragment containing a plurality of interspersed adaptors at predetermined sites; (b) providing a random array of concatemers fixed to a surface at a density such that at least a majority of the concatemers are optically resolvable; (c) identifying a sequence of at least a portion of each fragment adjacent to at least one interspersed adaptor in at least one concatemer, thereby identifying a nucleotide sequence of the target sequence. - View Dependent Claims (41, 42)
-
Specification