Genomic Sequencing
First Claim
1. A method for genome sequencing, the method comprising:
- as a random function, selecting a subset of fragments of a target genome;
replicating each fragment into clones;
ordering the clones into clone contigs based on sets of overlapping clones;
determining potential read overlaps from clone read data and validating base pairs of each read;
reading local assemblies of contigs from regions smaller than a clone length and assembling the local assemblies into read sets;
combining the assembled read sets into clone-sized regions; and
assembling the clone-sized regions into clone contigs.
2 Assignments
0 Petitions
Accused Products
Abstract
Genomic sequencing is implemented for high throughput applications that can include short reads. In one example, whole-genome sequencing involves a method in which a subset of fragments of a target genome are selected as a random function, and each fragment is replicated into clones. The clones are ordered into clone contigs based on sets of overlapping clones, and potential read overlaps are determined from clone read data. The method can also involve reading local assemblies of contigs from regions smaller than a clone length and assembling the local assemblies into read sets, combining the assembled read sets into clone-sized regions and assembling the clone-sized regions, and assembling the clone-sized regions into clone contigs. Overlapping sets of clones and their ordering can be determined computationally from read data, with a high depth of clone coverage to provide a large number of boundaries on which the assemblies can be segmented into overlapping regions of pooled reads.
-
Citations
16 Claims
-
1. A method for genome sequencing, the method comprising:
-
as a random function, selecting a subset of fragments of a target genome; replicating each fragment into clones; ordering the clones into clone contigs based on sets of overlapping clones; determining potential read overlaps from clone read data and validating base pairs of each read; reading local assemblies of contigs from regions smaller than a clone length and assembling the local assemblies into read sets; combining the assembled read sets into clone-sized regions; and assembling the clone-sized regions into clone contigs. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for genome sequencing that uses validated clones generated from a subset of fragments of a target genome and ordered into clone contigs based on sets of overlapping clones, the method comprising:
-
reading local assemblies of contigs from validated clone regions smaller than a clone length and assembling the local assemblies into read sets; combining the assembled read sets into clone-sized regions; and assembling the clone-sized regions into clone contigs. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A storage device comprising data representing computer-executable instructions that, in response to being accessed and executed by a computer, cause performance of a method for genome sequencing that uses validated clones generated from a subset of fragments of a target genome and ordered into clone contigs based on sets of overlapping clones, the method including the steps of:
-
reading local assemblies of contigs from validated clone regions smaller than a clone length and assembling the local assemblies into read sets; combining the assembled read sets into clone-sized regions; and assembling the clone-sized regions into clone contigs.
-
Specification