Iterative and regenerative DNA sequencing method
First Claim
1. A method for identifying a first nucleotide n and a second nucleotide n+x in a double stranded nucleic acid segment, comprising:
- a) digesting said double stranded nucleic acid segment with a restriction enzyme to produce a double stranded molecule having a single stranded overhang sequence corresponding to an enzyme cut site;
b) providing an adaptor having a cycle identification tag, a restriction enzyme recognition domain, a sequence identification region, and a detectable label;
c) hybridizing said adaptor to said double stranded nucleic acid having said single-stranded overhang sequence to form a ligated molecule;
d) identifying said nucleotide n by identifying said ligated molecule;
e) amplifying said ligated molecule from step (d) with a primer specific for said cycle identification tag of said adaptor; and
f) repeating steps (a) through (d) on said amplified molecule from step (e) to yield the identity of said nucleotide n+x, wherein x is less than or equal to the number of nucleotides between a recognition domain for a restriction enzyme and an enzyme cut site.
1 Assignment
0 Petitions
Accused Products
Abstract
An iterative and regenerative method for sequencing DNA is described. This method sequences DNA in discrete intervals starting at one end of a double stranded DNA segment. This method overcomes problems inherent in other sequencing methods, including the need for gel resolution of DNA fragments and the generation of artifacts caused by single-stranded DNA secondary structures. A particular advantage of this invention is that it can create offset collections of DNA segments and sequence the segments in parallel to provide continuous sequence information over long intervals. This method is also suitable for automation and multiplex automation to sequence large sets of segments.
-
Citations
191 Claims
-
1. A method for identifying a first nucleotide n and a second nucleotide n+x in a double stranded nucleic acid segment, comprising:
-
a) digesting said double stranded nucleic acid segment with a restriction enzyme to produce a double stranded molecule having a single stranded overhang sequence corresponding to an enzyme cut site;
b) providing an adaptor having a cycle identification tag, a restriction enzyme recognition domain, a sequence identification region, and a detectable label;
c) hybridizing said adaptor to said double stranded nucleic acid having said single-stranded overhang sequence to form a ligated molecule;
d) identifying said nucleotide n by identifying said ligated molecule;
e) amplifying said ligated molecule from step (d) with a primer specific for said cycle identification tag of said adaptor; and
f) repeating steps (a) through (d) on said amplified molecule from step (e) to yield the identity of said nucleotide n+x, wherein x is less than or equal to the number of nucleotides between a recognition domain for a restriction enzyme and an enzyme cut site. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A method for sequencing an interval within a double stranded nucleic acid segment by identifying a first nucleotide n and a second nucleotide n+x in a plurality of staggered double stranded molecules produced from said double stranded nucleic acid segment, comprising:
-
a) attaching an enzyme recognition domain to different positions along said double stranded nucleic acid segment within an interval no greater than the distance between a recognition domain for a restriction enzyme and an enzyme cut site, such attachment occurring at one end of said double. stranded nucleic acid segment;
b) digesting said double stranded nucleic acid segment with a restriction enzyme to produce a plurality of staggered double stranded molecules each having a single stranded overhang sequence corresponding to said cut site;
c) providing an adaptor having a restriction enzyme recognition domain, a sequence identification region, and a detectable label;
d) hybridizing said adaptor to said double stranded nucleic acid having said single-stranded overhang sequence to form a ligated molecule;
e) identifying a nucleotide n within a staggered double stranded molecule by identifying said ligated molecule;
f) repeating steps (b) through (e) to yield the identity of said nucleotide n+x in each of said staggered double stranded molecules having said single strand overhang sequence thereby sequencing an interval within said double stranded nucleic acid segment, wherein x is greater than one and no greater than the number of nucleotides between a recognition domain for a restriction enzyme and an enzyme cut site. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55)
-
-
56. A method for identifying a first nucleotide n and a second nucleotide n+x in a double stranded nucleic acid segment, comprising:
-
a) digesting said double stranded nucleic acid segment with a restriction enzyme to produce a double stranded molecule having a 5′
single stranded overhang sequence corresponding to an enzyme cut site;
b) identifying said nucleotide n by template-directed polymerization with a labeled nucleotide or nucleotide terminator;
c) providing an adaptor having a cycle identification tag and a restriction enzyme recognition domain;
d) ligating said adaptor to said double stranded nucleic acid to form a ligated molecule;
e) amplifying said ligated molecule from step (d) with a primer specific for said cycle identification tag of said adaptor; and
f) repeating steps (a) through (b) on said amplified molecule from step (e) to yield the identity of said nucleotide n+x, wherein x is less than or equal to the number of nucleotides between a recognition domain for a restriction enzyme and an enzyme cut site. - View Dependent Claims (57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86)
-
-
87. A method for sequencing an interval within a double stranded nucleic acid segment by identifying a first nucleotide n and a second nucleotide n+x in a plurality of staggered double stranded molecules produced from said double stranded nucleic acid segment, comprising:
-
a) attaching an enzyme recognition domain to different positions along said double stranded nucleic acid segment within an interval no greater than the distance between a recognition domain for a restriction enzyme and an enzyme cut site, such attachment occurring at one end of said double stranded nucleic acid segment;
b) digesting said double stranded nucleic acid segment with a restriction enzyme to produce a plurality of staggered double stranded molecules each having a 5′
single stranded overhang sequence corresponding to said cut site;
c) identifying a nucleotide n within a staggered double stranded molecule by template-directed polymerization with a labeled nucleotide or nucleotide terminator;
d) providing an adaptor having a restriction enzyme recognition domain;
e) ligating said adaptor to said double stranded nucleic acid to form a ligated molecule;
f) repeating steps (b) through (c) to yield the identity of said nucleotide n+x in each of said staggered double stranded molecules having said single strand overhang sequence thereby sequencing an interval within said double stranded nucleic acid segment, wherein x is greater than one and no greater than the number of nucleotides between a recognition domain for a restriction enzyme and an enzyme cut site. - View Dependent Claims (88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118)
-
-
119. A method for removing all or a part of a primer sequence from a primer extended product, comprising:
-
a) providing a primer sequence encoding a methylated portion of a restriction endonuclease recognition domain, wherein recognition of said domain by a restriction endonuclease requires at least one methylated nucleotide;
b) polymerizing by a template-directed primer extension using said primer and a nucleic acid segment to generate a primer extended product; and
c) digesting said primer extended product with a restriction endonuclease that recognizes the resulting double-stranded restriction endonuclease recognition domain encoded by said primer sequence in said primer extended product. - View Dependent Claims (120, 121, 122, 123, 124, 125, 126, 127)
-
-
128. A method for blocking a restriction endonuclease recognition domain in a primer extended product, comprising:
-
a) providing a primer with at least one modified nucleotide, wherein said modified nucleotide blocks an enzyme recognition domain, and at least a portion of said enzyme recognition domain sequence is encoded in said primer. b) polymerizing by a template-directed primer extension using said primer and a nucleic acid segment to generate a primer extended product; and
c) digesting said primer extended product with an enzyme that recognizes a double-stranded enzyme recognition domain in said primer extended product. - View Dependent Claims (129, 130, 131, 132, 133, 134, 135)
-
-
136. A method for automated sequencing of double-stranded DNA segments with nested single strand overhang templates, such method comprising the steps of
i) providing a support array having a plurality of sample holders arrayed in a matrix of positions on the support ii) immobilizing a plurality of double-stranded DNA segments at respective sample holders of said array, each DNA segment having an end comprising a single-strand overhang template sequence no long than about twenty nucleotides in length iii) simultaneously treating all sample holders with one or more reagents which selectively react with at least one nucleotide of said single-strand overhang template to effectively label the material at each holder iv) reading said array by automated scan detection to thereby determine at least one nucleotide of said single-strand overhang template, and v) reducing length of each strand of said DNA segment at each holder by a fixed number n> - 1 at said overhang end to produce a homologously ordered array of shorter and nested DNA segments, each with a single-strand overhang template sequence, and further performing steps iii) and iv) to determine at least one nucleotide at each single-strand overhang sequence, wherein the steps of treating, reading and reducing the length of the strands of the DNA segment at each holder by a number of n>
1 nucleotides are iteratively performed as automated process steps to produce nested and progressively shorter DNA segments and to sequence the plurality of DNA segments immobilized at the array of sample holders in situ. - View Dependent Claims (137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157)
- 1 at said overhang end to produce a homologously ordered array of shorter and nested DNA segments, each with a single-strand overhang template sequence, and further performing steps iii) and iv) to determine at least one nucleotide at each single-strand overhang sequence, wherein the steps of treating, reading and reducing the length of the strands of the DNA segment at each holder by a number of n>
-
158. A method for automated sequencing of double stranded DNA segments, such method being characterized by steps of
attaching a recognition domain to each segment to form a set of DNA segments having the recognition domain nested at an interval no greater than the distance between the recognition domain and its cut site for a given enzyme that recognizes said recognition domain treating the DNA segments with an enzyme that recognizes said attached recognition domain, and cuts each strand of each DNA segment to create an overhang template at a distance of > - 1 nucleotide along the DNA segment from said recognition domain, and thereby generating a set of nested overhang templates.
determining at least one nucleotide of each of said nested overhang templates, and thereafter reducing length of each strand at the end of the DNA segment with the overhang template by >
1 nucleotide to produce a corresponding set of shorter DNA segments each with an overhang template, said step of reducing being performed by removing a block of nucleotides, whereby each shorter DNA segment with an overhang template is a known subinterval of a previous DNA segment with overhang.
- 1 nucleotide along the DNA segment from said recognition domain, and thereby generating a set of nested overhang templates.
-
159. A method for automated sequencing of double-stranded DNA segments, such method comprising the steps of
i) providing a support array having a plurality of sample holders arrayed in a matrix of positions on the support ii) immobilizing a plurality of double-stranded DNA segments at respective sample holders of said array, each DNA segment having an end comprising a single-strand overhang template sequence no long than about twenty nucleotides in length iii) simultaneously treating all sample holders with one or more reagents which selectively react with at least one nucleotide of said single-strand overhang template to effectively label the material at each holder iv) reading said array by automated scan detection to thereby determine at least one nucleotide of said single-strand overhang template v) regenerating material at the respective sample holders by DNA amplification in vitro vi) reducing length of each strand of said DNA segment at each holder by a fixed number n≧ - 1 at said overhang end to produce a homologously ordered array of trimmed DNA segments, each with a single-strand overhang template sequence, and further performing step iii) to determine at least one nucleotide at each single-strand overhang sequence, wherein the steps of treating, reading, reducing lengths and product regeneration are iteratively performed as automated process steps to produce progressively trimmed DNA segments and to sequence the plurality of DNA segments immobilized at the array of sample holders in situ.
- View Dependent Claims (160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180)
-
181. A method for automated sequencing of double stranded DNA segments, such method being characterized by steps of
attaching a recognition domain to each segment to form DNA segments having the recognition domain. regenerating the template precursor by DNA amplification in vitro treating the DNA segments with an enzyme that recognizes said attached recognition domain, and cuts each strand of each DNA segment to create an overhang template at a distance of ≧ - 1 nucleotide along the DNA segment from said recognition domain
determining at least one nucleotide of said overhang template, and thereafter reducing length of each strand at the end of the DNA segment with the overhang template by ≧
1 nucleotide to produce a corresponding set of trimmed DNA segments each with an overhang template, said step of reducing being performed by removing a block of nucleotides, whereby each trimmed DNA segment with an overhang template is a known subinterval of a previous DNA segment with overhang.
- 1 nucleotide along the DNA segment from said recognition domain
-
182. A method for identifying a first nucleotide n and a second nucleotide n+x in a double stranded nucleic acid segment, comprising:
-
a) digesting said double stranded nucleic acid segment with a restriction enzyme to produce a double stranded molecule having a single stranded overhang sequence corresponding to an enzyme cut site;
b) providing an adaptor having a cycle identification tag, a restriction enzyme recognition domain and a sequence identification region;
c) hybridizing said adaptor to said double stranded nucleic acid having said single-stranded overhang sequence to form a ligated molecule;
d) amplifying said ligated molecule from step (c) with a labeled primer specific for said cycle identification tag, restriction enzyme recognition domain, and a portion of said sequence identification region of said adaptor;
e) identifying said nucleotide n by identifying said primer incorporated into the amplification product; and
f) repeating steps (a) through (e) on said amplified molecule from step (e) to yield the identity of said nucleotide n+x, wherein x is less than or equal to the number of nucleotides between a recognition domain for a restriction enzyme and an enzyme cut site.
-
-
183. A method for identifying a first nucleotide n and a second nucleotide n+x in a double stranded nucleic acid segment, comprising:
-
a) digesting said double stranded nucleic acid segment with a restriction enzyme, resulting in a trimmed end in said double stranded molecule;
b) providing an adaptor having a cycle identification tag and a restriction enzyme recognition domain;
c) ligating said adaptor to the trimmed end of said double stranded nucleic acid to form a ligated molecule;
d) amplifying said ligated molecule from step (c) with a labeled primer specific for said cycle identification tag and said restriction enzyme recognition domain of the adaptor, and for a nucleotide in said trimmed end in said double stranded molecule;
e) identifying said nucleotide n by identifying said primer incorporated into the amplification product; and
f) repeating steps (a) through (e) on said amplified molecule from step (e) to yield the identity of said nucleotide n+x, wherein x is less than or equal to the number of nucleotides between a recognition domain for a restriction enzyme and an enzyme cut site.
-
-
184. A method for removing all or part of a primer sequence from a primer extended product comprising:
-
a) providing a primer sequence comprising at least a portion of a restriction endonuclease recognition domain;
b) polymerizing by a template-directed primer extension using said primer, a methylated nucleotide, and a nucleic acid segment to generate a primer extended product during nucleic acid amplification in vitro, wherein the non-methylated nucleotide corresponding to the methylated nucleotide is contained within said portion of the recognition domain sequence in said primer sequence; and
c) digesting said primer extended product with a restriction endonuclease that recognizes the resulting hemi-methylated double-stranded restriction endonuclease recognition domain of said primer sequence in said primer extended product, and does not recognize the double-methylated products resulting from said nucleic acid amplification in vitro. - View Dependent Claims (185, 186, 187, 188, 189, 190, 191)
-
Specification