×

Field-based similarity search system and method

  • US 8,306,755 B2
  • Filed: 05/23/2011
  • Issued: 11/06/2012
  • Est. Priority Date: 03/23/2001
  • Status: Expired due to Fees
First Claim
Patent Images

1. A similarity search method, comprising:

  • generating a feature database which stores data pertaining to a candidate molecule, as executed by a processor of a computer, said database comprising a hash table having entries which are generated based on;

    a set of descriptors generated from conformations of fragment graphs of said candidate molecule, said fragment graphs including plural fragment nodes connected by rotatable bond edges, a specific conformation of said fragment node comprising a fragment of said candidate molecule, and two neighboring fragments connected by a rotatable bond at a specific dihedral angle comprising a fragment pair; and

    a context-adapted descriptor-to-key mapping which maps said set of descriptors to a set of feature keys comprising indices that label grid cells in discriminant space;

    generating scoops, descriptors and keys for a query molecule;

    identifying a match between a query molecule fragment pair feature and a candidate molecule fragment pair feature by comparing said keys of said query molecule to said keys in said feature database, a correspondence comprising a scoop for said candidate molecule stored in said feature database having a same key as a scoop for said query molecule;

    using said match to align a fragment pair of said candidate molecule to said query molecule by overlaying internal coordinate axes of said scoops for said candidate and query molecules, said correspondence implying an alignment of said candidate molecule and query molecule fragment pairs;

    determining the number of candidate molecule fragment pairs the number of candidate molecule fragment pairs in the set, Cfp, being given by Cfp=(n−

    1)·

    (Cfrag)2·

    Crbe, where n is the number of fragments in said candidate molecule and n−

    1 is the number of rotatable bond edges connecting said n fragments, Cfrag is the number of conformations in a fragment of said fragments, and Crbe is the number of steps in which said rotatable bond edges are sampled;

    assembling the entirety of the fragment pairs of said candidate molecule to form an alignment thereof onto said query molecule;

    retrieving a candidate molecule fragment pair having at least a predetermined number of matching features; and

    displaying a result of said retrieving said candidate molecule fragment pair.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×