Methods and compositions for sequencing modified nucleic acids
First Claim
Patent Images
1. A method for identifying sequence differences between two nucleic acids from different sources, the method comprising:
- a) providing a first nucleic acid from a first sample;
b) providing a second nucleic acid from a second sample;
c) treating the first nucleic acid to produce a modified nucleic acid;
d) denaturing the modified nucleic acid and the second nucleic acid;
e) annealing the modified nucleic acid to the second nucleic acid, thereby producing hybrid nucleic acids that comprise a modified strand from the first sample and an unmodified strand from the second sample, and further wherein a subset of the hybrid nucleic acids comprise non-complementary regions of one or more base pairs due to sequence differences between the modified strand and the unmodified strand;
f) binding a non-complementary-region-specific binding agent selected from the group consisting of a single-strand-specific binding agent and a mismatch-repair agent to the subset of the hybrid nucleic acids, wherein the non-complementary-region-specific binding agent comprises a selectable tag;
g) capturing the subset of the hybrid nucleic acids bound to the non-complementary-region-specific binding agent using the selectable tag;
h) removing nucleic acids not bound to the non-complementary-region-specific binding agent;
i) subjecting the subset of the hybrid nucleic acids to a single type of sequencing reaction in which both the first and second strands of the hybrid nucleic acids are sequenced and both sequence data and modification data are provided in a single sequence read;
j) for each sequence read, analyzing the sequencing data to identify said non-complementary regions; and
k) for each sequence read, analyzing the modification data to determine which subset of said sequence read corresponds to the modified strand and which subset corresponds to the unmodified strand, thereby identifying sequence differences between the first nucleic acid from the first sample and the second nucleic acid from the second sample.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods, compositions, and systems are provided for characterization of modified nucleic acids. In certain preferred embodiments, single molecule sequencing methods are provided for identification of modified nucleotides within nucleic acid sequences. Modifications detectable by the methods provided herein include chemically modified bases, enzymatically modified bases, abasic sites, non-natural bases, secondary structures, and agents bound to a template nucleic acid.
-
Citations
12 Claims
-
1. A method for identifying sequence differences between two nucleic acids from different sources, the method comprising:
-
a) providing a first nucleic acid from a first sample; b) providing a second nucleic acid from a second sample; c) treating the first nucleic acid to produce a modified nucleic acid; d) denaturing the modified nucleic acid and the second nucleic acid; e) annealing the modified nucleic acid to the second nucleic acid, thereby producing hybrid nucleic acids that comprise a modified strand from the first sample and an unmodified strand from the second sample, and further wherein a subset of the hybrid nucleic acids comprise non-complementary regions of one or more base pairs due to sequence differences between the modified strand and the unmodified strand; f) binding a non-complementary-region-specific binding agent selected from the group consisting of a single-strand-specific binding agent and a mismatch-repair agent to the subset of the hybrid nucleic acids, wherein the non-complementary-region-specific binding agent comprises a selectable tag; g) capturing the subset of the hybrid nucleic acids bound to the non-complementary-region-specific binding agent using the selectable tag; h) removing nucleic acids not bound to the non-complementary-region-specific binding agent; i) subjecting the subset of the hybrid nucleic acids to a single type of sequencing reaction in which both the first and second strands of the hybrid nucleic acids are sequenced and both sequence data and modification data are provided in a single sequence read; j) for each sequence read, analyzing the sequencing data to identify said non-complementary regions; and k) for each sequence read, analyzing the modification data to determine which subset of said sequence read corresponds to the modified strand and which subset corresponds to the unmodified strand, thereby identifying sequence differences between the first nucleic acid from the first sample and the second nucleic acid from the second sample. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for identifying loci where two nucleic acids from two different sources are non-complementary, the method comprising:
-
a) providing a first nucleic acid from a first sample; b) providing a second nucleic acid from a second sample; c) denaturing the first nucleic acid and the second nucleic acid; d) annealing the first nucleic acid to the second nucleic acid, thereby producing hybrid nucleic acids that comprise a first strand from the first sample and a second strand from the second sample, and further wherein a subset of the hybrid nucleic acids comprise non-complementary regions of one or more base pairs due to sequence differences between the first strand and the second strand; e) performing mismatch repair on the hybrid nucleic acids using modified nucleotides comprising a selectable tag, thereby generating repaired hybrid nucleic acids; f) isolating the repaired hybrid nucleic acids using the selectable tag; g) subjecting the repaired hybrid nucleic acids to a single type of sequencing reaction in which both the first and second strands of the repaired hybrid nucleic acids are sequenced and both sequence data and modification data are provided in a single sequence read; and h) for each sequence read, analyzing the sequence data and modification data to determine which subset of said sequence read comprises the modified nucleotides, thereby identifying loci at which the first nucleic acid and second nucleic acid were non-complementary. - View Dependent Claims (9, 10, 11, 12)
-
Specification