×

Systems and methods for analyzing viral nucleic acids

  • US 10,724,110 B2
  • Filed: 02/03/2016
  • Issued: 07/28/2020
  • Est. Priority Date: 09/01/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method for analyzing a virus, the method comprising using at least one hardware processor connected to a tangible memory subsystem to perform:

  • obtaining a plurality of sequence reads from a sample containing viral nucleic acid;

    creating, in the tangible memory subsystem, a viral DNA reference graph representing a plurality of known viral sequences, the viral DNA reference graph comprising a directed acyclic graph (DAG) comprising a plurality of vertices stored as objects in the tangible memory subsystem, wherein sequence strings of the known viral sequences that match each other when aligned are each represented by a single object and sequence strings that vary are represented as alternate objects, wherein at least one sequence string comprises a plurality of symbols, and wherein each object is stored in the tangible memory subsystem as a sequence string and a list of one or more pointers to adjacent objects, wherein each pointer identifies a physical location in the tangible memory subsystem at which an adjacent object is stored, such that the objects are linked to represent each of the known viral sequences as a path through the viral DNA reference graph;

    aligning the sequence reads to paths through the viral DNA reference graph, wherein the aligning comprises calculating match scores between a sequence read and a sequence string associated with each vertex, and looking backwards at each vertex to predecessor vertices if and only if a symbol comprises the first symbol of the sequence string associated with its vertex to select a path with an optimal score;

    retrieving, based on the aligned sequence reads to paths through the viral DNA reference graph, an identity of a virus for the one of the plurality of known viral sequences; and

    providing a report that includes the identity of the virus.

View all claims
  • 12 Assignments
Timeline View
Assignment View
    ×
    ×