×

Using relationships in candidate discovery

  • US 8,150,813 B2
  • Filed: 12/18/2008
  • Issued: 04/03/2012
  • Est. Priority Date: 12/18/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of resolving entities in an entity resolution system storing identity records related to a plurality of entities, the method comprising:

  • receiving a new identity record;

    identifying a set of candidate entities, from the plurality of entities, based upon a match between an attribute of the new identity record and corresponding attributes of one or more of the plurality of entities;

    identifying, from the plurality of entities not included in the set of candidate entities, a set of first-degree entities having a likeness score satisfying a threshold, wherein the likeness score for each first-degree entity is determined relative to a respective candidate entity;

    by operation of one or more processors, identifying, from the plurality of entities not included in the set of candidate entities and not included in the set of first-degree entities, a set of second-degree entities having a likeness score satisfying the threshold, wherein the likeness score for each second-degree entity is determined relative to a respective first-degree entity, wherein the threshold is based on a count of degrees of separation from a respective candidate entity, such that the threshold to be satisfied by the set of second-degree entities is stricter than the threshold to be satisfied by the set of first-degree entities;

    adding, to the set of candidate entities, the set of first-degree entities and the set of second-degree entities; and

    upon determining that the new identity record refers to a candidate entity in the set of candidate entities, including any added entities, conjoining the new identity record and the candidate entity to form a first conjoined entity, wherein the first conjoined entity is further conjoinable with a different entity of the plurality of entities to resolve an instance of data ambiguity.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×