×

Methods and systems for copy number variant detection

  • US 10,395,759 B2
  • Filed: 05/18/2015
  • Issued: 08/27/2019
  • Est. Priority Date: 05/18/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving, by a computing device, a sample coverage data set comprising a plurality of genomic sequences obtained from sequencing of nucleic acid samples of a subject, and sample sequencing quality control (SSQC) metrics;

    grouping, by the computing device, sets of sequencing quality control (SQC) metrics into a multidimensional tree data structure according to similarity, wherein each set of SQC metrics is associated with a respective reference coverage data set that comprises a plurality of genomic regions and read depths;

    selecting, by the computing device, a reference panel of reference coverage data sets using the multidimensional tree data structure, wherein the selected reference coverage data sets have SQC metrics similar to the SSQC metrics;

    normalizing, by the computing device, the sample coverage data set and the reference panel;

    fitting, by the computing device, the normalized reference panel to a mixture model at each of the plurality of genomic regions to determine an expected coverage distribution at each of the plurality of genomic regions; and

    identifying one or more copy number variants (CNVs) by comparing, by the computing device, according to a Hidden Markov Model (HMM), the normalized sample coverage data set to the expected coverage distribution at each of the plurality of genomic regions from the mixture model.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×