×

System and method for automatic weight generation for probabilistic matching

  • US 8,332,366 B2
  • Filed: 06/01/2007
  • Issued: 12/11/2012
  • Est. Priority Date: 06/02/2006
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of automatically generating weights for associating a plurality of data records from one or more data sources at one or more physical locations, comprising:

  • at one or more computer systems or devices coupled to the one or more databases at the one or more physical locations;

    generating unmatched probabilities for a set of candidate data records, wherein the unmatched probabilities are computed per attribute for each pair of data records in the set of candidate data records;

    determining default discrepancy probabilities per attribute for each pair of data records in the set of candidate data records based upon a data quality parameter;

    calculating initial weights per attribute based upon the unmatched probabilities and the default discrepancy probabilities; and

    iterating a process comprising the steps of;

    comparing each pair of data records in the set of candidate data records using the initial weights per attribute;

    determining a candidate matched set with results from the comparing step;

    generating true discrepancy probabilities with scoring information from the candidate matched set;

    calculating new weights per attribute based upon the unmatched probabilities and the true discrepancy probabilities to adjust performance of the association of data records; and

    testing for weight convergence and using the new weights if a difference between the current weights and the new weights is larger than a predetermined amount.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×