×

SYSTEMS AND METHODS FOR AUTOMATIC CLUSTERING AND CANONICAL DESIGNATION OF RELATED DATA IN VARIOUS DATA STRUCTURES

  • US 20170052958A1
  • Filed: 08/10/2016
  • Published: 02/23/2017
  • Est. Priority Date: 08/19/2015
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising:

  • a data store configured to store computer-executable instructions and a plurality of records, wherein each record of the plurality of records is associated with a respective entity and comprises one or more fields;

    a computing device including a processor in communication with the data store, the processor configured to execute the computer-executable instructions to at least;

    identify, based at least in part on a first field of the one or more fields, a first group of the plurality of records;

    divide the first group into one or more record pairs, each of the one or more record pairs comprising a respective first record and second record;

    determine, for each of the one or more record pairs, a respective match score, the respective match scores comprising probabilities that the respective first record and second record of the respective record pairs are associated with a respective same entity;

    identify a cluster of record pairs, wherein each pair in the cluster has a record in common with at least one other pair in the cluster, and wherein each pair in the cluster has a respective match score above a threshold; and

    output the cluster of record pairs to a client computing device.

View all claims
  • 8 Assignments
Timeline View
Assignment View
    ×
    ×