×

Automatic discovery of new entities using graph reconciliation

  • US 10,331,706 B1
  • Filed: 10/04/2017
  • Issued: 06/25/2019
  • Est. Priority Date: 10/04/2013
  • Status: Active Grant
First Claim
Patent Images

1. A computer system comprising:

  • memory storing;

    a target data graph, anda plurality of source data graphs generated from analysis of source documents, where each source data graph;

    is associated with a source document having an associated domain,has a source entity that does not exist in the target data graph, andincludes fact tuples, where a fact tuple identifies;

    a subject entity,an object entity, anda relationship connecting the subject entity to the object entity;

    at least one processor; and

    memory storing instructions that, when executed by the at least one processor, cause the computer system to perform operations including;

    generating a cluster of source data graphs, the cluster including source data graphs associated with a first source entity that shares at least two fact tuples that have the first source entity as the subject entity and a determinative relationship as the relationship connecting the subject entity to the object entity,iteratively splitting the cluster of source data graphs into a plurality of buckets of the source data graphs based on a plurality of fact tuples, wherein each iteration operates on a distinct determinative relationship and each of the buckets for the iteration includes source data graphs that share a value for the fact tuple,discarding one or more buckets from the cluster,generating a reconciled graph by merging the source data graphs remaining in the cluster, andgenerating a suggested new entity and entity relationships for the target data graph based on the reconciled graph.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×