×

Distributed data processing platform for metagenomic monitoring and characterization

  • US 10,127,352 B1
  • Filed: 12/30/2015
  • Issued: 11/13/2018
  • Est. Priority Date: 04/06/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • configuring a first processing node for communication with one or more additional processing nodes and with one or more of a plurality of geographically-distributed metagenomics sequencing centers via one or more networks;

    processing metagenomics sequencing results obtained from one or more of the metagenomics sequencing centers in the first processing node; and

    providing surveillance functionality relating to at least one designated biological issue on behalf of one or more requesting clients based at least in part on the processing of metagenomics sequencing results performed by the first processing node and related processing performed by one or more of the additional processing nodes;

    wherein each of the metagenomics sequencing centers is configured to perform metagenomics sequencing on biological samples from respective sample sources in a corresponding data zone;

    wherein processing the metagenomics sequencing results further comprises generating a hit abundance score vector for a given one of the biological samples wherein the hit abundance score vector comprises a plurality of entries corresponding to respective occurrence frequencies of at least one read of the given biological sample in respective target genomic sequences;

    wherein providing surveillance functionality further comprises;

    performing a preprocessing operation to reduce a biclustering sample space of a genomic comparison component;

    generating a hit abundance score matrix for the genomic comparison component comprising a plurality of the hit abundance score vectors wherein one of rows and columns of the hit abundance score matrix correspond to respective different ones of the biological samples and the other of the rows and columns of the hit abundance score matrix correspond to respective different ones of the target genomic sequences; and

    performing a biclustering operation on the hit abundance score matrix; and

    wherein the method is implemented by at least one processing device comprising a processor coupled to a memory.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×