×

USING VERTEX SELF-INFORMATION SCORES FOR VERTICES IN AN ENTITY GRAPH TO DETERMINE WHETHER TO PERFORM ENTITY RESOLUTION ON THE VERTICES IN THE ENTITY GRAPH

  • US 20160012149A1
  • Filed: 07/09/2014
  • Published: 01/14/2016
  • Est. Priority Date: 07/09/2014
  • Status: Active Grant
First Claim
Patent Images

1. A computer program product for entity resolution of records in a database, the computer program product comprising a computer readable storage medium having computer readable program code embodied therein that when executed performs operations, the operations comprising:

  • determining pairs of records in the database having a relationship value satisfying a threshold;

    generating an entity relationship graph having a vertex for each of the records of the pairs and an edge for each of the determined pairs between two vertices representing records in one of the determined pairs, wherein each vertex is associated with a self-information score based on content in the record represented by the vertex and is assigned an initial unique entity identifier and an entity information score;

    for each subject vertex of the vertices, performing;

    determining a target vertex directly connected to the subject vertex that has a highest entity information score of at least one vertex directly connected to the subject vertex that has an entity information score greater than the entity information score of the subject vertex; and

    determining whether to set the subject vertex entity identifier and entity information score to the entity identifier and entity information score of the target vertex based on the target vertex self-information score.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×