Methods for genome assembly and haplotype phasing
First Claim
Patent Images
1. A method comprising:
- contacting a sample to a stabilizing agent, said sample comprising a nucleic acid molecule complexed to at least one nucleic acid binding protein;
cleaving the nucleic acid into a plurality of segments comprising at least a first segment and a second segment;
attaching the first segment and the second segment at a junction;
obtaining at least some sequence on each side of the junction to generate a first read pair;
mapping the first read pair to a set of contigs; and
determining a path through the set of contigs that represents an order and/or orientation to a genome, wherein the path through the set of contigs that represents an order and/or orientation to the genome is determined so that each contig is visited exactly once.
1 Assignment
0 Petitions
Accused Products
Abstract
The disclosure provides methods to assemble genomes of eukaryotic or prokaryotic organisms. The disclosure further provides methods for haplotype phasing and meta-genomics assemblies.
82 Citations
38 Claims
-
1. A method comprising:
-
contacting a sample to a stabilizing agent, said sample comprising a nucleic acid molecule complexed to at least one nucleic acid binding protein; cleaving the nucleic acid into a plurality of segments comprising at least a first segment and a second segment; attaching the first segment and the second segment at a junction; obtaining at least some sequence on each side of the junction to generate a first read pair; mapping the first read pair to a set of contigs; and determining a path through the set of contigs that represents an order and/or orientation to a genome, wherein the path through the set of contigs that represents an order and/or orientation to the genome is determined so that each contig is visited exactly once. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method comprising:
-
contacting a sample to a stabilizing agent, said sample comprising a nucleic acid molecule complexed to at least one nucleic acid binding protein; cleaving the nucleic acid into a plurality of segments comprising at least a first segment and a second segment; attaching the first segment and the second segment at a junction; obtaining at least some sequence on each side of the junction to generate a first read pair; mapping the first read pair to a set of contigs; and determining a path through the set of contigs that represents an order and/or orientation to a genome, wherein determining a path through the set of contigs that represents an order and/or orientation to the genome comprises down-weighing contigs that represent promiscuous regions of the genome. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
Specification