×

Using vertex self-information scores for vertices in an entity graph to determine whether to perform entity resolution on the vertices in the entity graph

  • US 10,108,746 B2
  • Filed: 01/30/2015
  • Issued: 10/23/2018
  • Est. Priority Date: 07/09/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method performed by a computer program executed by a processor to perform entity resolution of records in a database implemented in a computer storage device, comprising:

  • determining pairs of records in the database having a relationship value satisfying a threshold;

    generating an entity relationship graph having a vertex for each of the records of the pairs and an edge for each of the determined pairs between two vertices representing records in one of the determined pairs, wherein each vertex is associated with a self-information score based on content in the record represented by the vertex and is assigned an initial unique entity identifier and an entity information score, which is initially set to the self-information score of the vertex; and

    determining whether to update the entity information score and entity identifier for each subject vertex of the vertices by performing for each subject vertex of the vertices;

    determining a target vertex directly connected to the subject vertex that has a highest entity information score of at least one vertex directly connected to the subject vertex that has an entity information score greater than the entity information score of the subject vertex; and

    setting the subject vertex entity identifier and entity information score to the entity identifier and entity information score of the target vertex in response to a target vertex self-information score satisfying a criteria to perform entity resolution for the record represented by the subject vertex.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×