×

System and method for automatic weight generation for probabilistic matching

  • US 20080005106A1
  • Filed: 06/01/2007
  • Published: 01/03/2008
  • Est. Priority Date: 06/02/2006
  • Status: Active Grant
First Claim
Patent Images

1. A method of automatically generating weights for associating a plurality of data records, comprising:

  • generating unmatched probabilities for a set of candidate data records, wherein the unmatched probabilities are computed per attribute for each pair of data records in the set of candidate data records;

    determining default discrepancy probabilities per attribute for each pair of data records in the set of candidate data records based upon a data quality parameter;

    calculating initial weights per attribute based upon the unmatched probabilities and the default discrepancy probabilities; and

    iterating a process comprising the steps of;

    comparing each pair of data records in the set of candidate data records using the initial weights;

    determining a candidate matched set with results from the comparing step;

    generating true discrepancy probabilities with scoring information from the candidate matched set;

    calculating new weights based upon the unmatched probabilities and the true discrepancy probabilities; and

    testing for weight convergence.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×