Processing and Analysis of Complex Nucleic Acid Sequence Data
First Claim
Patent Images
1. A method of determining a sequence of a complex nucleic acid of one or more organisms, the method comprising:
- (a) receiving, at one or more computing devices, a plurality of reads of the complex nucleic acid; and
(b) producing, with the one or more computing devices, an assembled sequence of the complex nucleic acid from the reads, the assembled sequence comprising less than one false single nucleotide variant per megabase at a call rate of 70% or greater.
0 Assignments
0 Petitions
Accused Products
Abstract
The present invention is directed to logic for analysis of nucleic acid sequence data that employs algorithms that lead to a substantial improvement in sequence accuracy and that can be used to phase sequence variations, e.g., in connection with the use of the long fragment read (LFR) process.
133 Citations
39 Claims
-
1. A method of determining a sequence of a complex nucleic acid of one or more organisms, the method comprising:
-
(a) receiving, at one or more computing devices, a plurality of reads of the complex nucleic acid; and (b) producing, with the one or more computing devices, an assembled sequence of the complex nucleic acid from the reads, the assembled sequence comprising less than one false single nucleotide variant per megabase at a call rate of 70% or greater. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 36)
-
-
35. A method of determining a sequence of a human genome, the method comprising:
-
(a) receiving at one or more computing devices, a plurality of reads of the genome; and (b) producing, with the one or more computing devices, an assembled sequence of the genome from the reads, the assembled sequence comprising less than 600 false single nucleotide variants per gigabase at a genome call rate of 70% or greater. - View Dependent Claims (37)
-
-
38. A method of determining a sequence of a human genome, the method comprising:
-
(a) receiving, at one or more computing devices, a plurality of reads from each of a plurality of aliquots, each aliquot comprising a fragment of the human genome; and (b) producing, with the one or more computing devices, a phased, assembled sequence of the genome from the reads, the assembled sequence comprising less than 1000 false single nucleotide variants per gigabase at a genome call rate of 70% or greater. - View Dependent Claims (39)
-
Specification