×

Drug discovery methods

  • US 10,453,553 B2
  • Filed: 06/10/2013
  • Issued: 10/22/2019
  • Est. Priority Date: 02/04/2002
  • Status: Active Grant
First Claim
Patent Images

1. A method for evaluating user-supplied genomics data, comprising:

  • defining a profile model based on a profile definition criterion, wherein a profile in the profile model includes a subset of data contained in a structured database, the structured database comprising findings data, wherein the profile definition criterion comprises any of a combination of genes or gene products that form all or part of a disease related pathway, cells or cellular components, anatomical parts, molecular, cellular, or disease processes and relationships between them, a threshold profile size, findings connectivity metrics, or a combination thereof;

    building a library of profiles according to the profile model using the structured database, wherein the findings data of the structured database are structured according to predetermined, causal relationships among gene or gene products, the structured database comprising a knowledge base having a frame-based knowledge representation data model structured according to an ontology having slots and facets that define relationships between different instances in the ontology, and the structured database defining biological relationships that are at least one step removed;

    wherein building the library of profiled using the structured database comprises;

    querying the structured database to find networks of findings within the structured database that meet the profile definition criterion;

    extracting said networks of findings from the structured database;

    generating graph data structures based on said networks of findings thereby forming an initial set of profiles;

    performing a gene expression sensitivity test on the initial set of profiles;

    changing criteria of a subset of the initial set of profiles, based on said gene expression sensitivity test, thereby enlarging one or more profiles in the subset of profiles; and

    repeating the steps of performing a gene expression sensitivity test on the enlarged profiles and changing criteria of a subset of the enlarged profiles based on said gene expression sensitivity test, in a recursive manner, based on the profile definition criterion, wherein the repeating results in the library of profiles;

    receiving user-supplied genomics data;

    scoring each profile of the library of profiles against the user-supplied genomics data using a profile-to-data scoring algorithm;

    identifying one or more profiles in the library of profiles that have respective subsets of data that overlap at least a portion of the user-supplied genomics data;

    determining, for each such overlapped profile, whether the overlap with the user-supplied genomics data is statistically significant based on a computed P-value, wherein the P-value is a probability measure that indicates the likelihood that the overlap is due to chance; and

    presenting a list of one or more overlapped profiles determined to have a statistically significant overlap, wherein the list is ranked according to the P-value computed for each of the one or more statistically significant profiles,wherein each of the defining, building, receiving, scoring, identifying, determining, and presenting steps are performed by a processing system.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×