×

Long fragment de novo assembly using short reads

  • US 10,726,942 B2
  • Filed: 08/25/2014
  • Issued: 07/28/2020
  • Est. Priority Date: 08/23/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method of determining a sequence of a first chromosomal region of an organism, the method comprising:

  • receiving, at a computer system, sequence data from a sequencing of a plurality of nucleic acid molecules of the organism, wherein the sequence data for each of the plurality of nucleic acid molecules includes;

    one or more sequence reads of at least one portion of the nucleic acid molecule, anda label corresponding to the one or more sequence reads, the label indicating an origin of the nucleic acid molecule, wherein the sequence data includes at least 1,000 sequence reads;

    receiving, by the computer system, a first contig of the first chromosomal region;

    analyzing the at least 1,000 sequence reads to determine a group of sequence reads of the sequence data that overlap with an end sequence of the first contig, the group including sequence reads with a plurality of different labels indicating different origins of corresponding nucleic acid molecules, the different origins including different haplotypes; and

    extending, by the computer system, the first contig using the group of sequence reads of the sequence data that overlap with the end sequence of the first contig.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×