×

Computer-based method for finding similar objects using a taxonomy

  • US 7,533,096 B2
  • Filed: 07/12/2006
  • Issued: 05/12/2009
  • Est. Priority Date: 07/12/2006
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-based method of finding similar items labeled in a taxonomy comprising the steps of:

  • determining a label LA representing a set of concepts that a target object T and a candidate object C have in common, said target object T and candidate object C part of a taxonomy structure as acyclic graphs wherein at least one child class has multiple parents;

    determining information content I(LA) of label LA representing said set of common concepts;

    combining individual information content I(LT) and I(LC), where I(LT) and I(LC) represent individual information content of labels of target object and candidate object, respectively,finding similarity between said target object and said candidate object in said taxonomy as a function of I(LA) and I(LT)+I(LC), andwherein said information content I(LA), I(LT), and I(LC) are functions of inclusion probabilities p(LA), p(LT), and p(LC), respectively, said inclusion probability of label L defined as the probability that an ancestor graph of label L of an object o chosen at random from a corpus custom character

    contains L, and said similarity between said target object T and said candidate object C is found based on the following mathematical function;

    sim

    ( T , C )
    = 2

    I

    ( L A )
    I

    ( L T )
    + I

    ( L C )
    ,
    and said inclusion probability is given by;


    pi(L)=p(L custom character

    Terms(Anc(o))).

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×