×

Using cell-free DNA fragment size to determine copy number variations

  • US 10,095,831 B2
  • Filed: 12/16/2016
  • Issued: 10/09/2018
  • Est. Priority Date: 02/03/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method, implemented using a computer system comprising one or more processors and system memory, for determining a copy number variation (CNV) of a nucleic acid sequence of interest in a test sample comprising cell-free nucleic acid fragments originating from two or more genomes, the method comprising:

  • (a) receiving, by the computer system, sequence reads obtained by sequencing the cell-free nucleic acid fragments in the test sample;

    (b) aligning, by the one or more processors, the sequence reads of the cell-free nucleic acid fragments or aligning fragments containing the sequence reads to bins of a reference genome comprising the sequence of interest, thereby providing test sequence tags, wherein the reference genome is divided into a plurality of bins;

    (c) determining fragment sizes of at least some of the cell-free nucleic acid fragments present in the test sample;

    (d) for cell-free nucleic acid fragments determined as being in a first size domain, calculating, by the one or more processors, first coverages of the sequence tags for the bins of the reference genome by, for each bin;

    (i) determining a number of sequence tags aligning to the bin, and(ii) normalizing the number of sequence tags aligning to the bin by accounting for bin-to-bin variations due to factors other than copy number variation;

    (e) for cell-free nucleic acid fragments determined as being in a second size domain, calculating, by the one or more processors, second coverages of the sequence tags for the bins of the reference genome by, for each bin;

    (i) determining a number of sequence tags aligning to the bin, and(ii) normalizing the number of sequence tags aligning to the bin by accounting for bin-to-bin variations due to factors other than copy number variation; and

    (f) determining a copy number variation in the sequence of interest using a likelihood ratio calculated from the first coverages and the second coverages.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×