×

STATISTICAL MEASURE AND CALIBRATION OF INTERNALLY INCONSISTENT SEARCH CRITERIA WHERE ONE OR BOTH OF THE SEARCH CRITERIA AND DATABASE IS INCOMPLETE

  • US 20100005057A1
  • Filed: 07/02/2009
  • Published: 01/07/2010
  • Est. Priority Date: 07/02/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method of identifying, using an internally inconsistent search criteria, an entity representation in an electronic universal database that corresponds to an entity representation in an electronic foreign database, each database comprising a plurality of entity representations, each entity representation comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight, the method comprising:

  • receiving a plurality of search criteria field values, each search criteria field value associated with a field, wherein at least two search criteria field values are associated with a same field, wherein the at least two search criteria field values are not identical;

    receiving at least one match template specifying an ordered plurality of fields;

    forming and electronically storing, for each match template, a table comprising field value weights for a plurality of matches between a search criteria field value and a field value appearing in a record in the universal database, the field value weights in each table arranged according to entity representation, wherein the at least two search criteria field values match field values in two records in a same entity representation, and wherein at least one table comprises a sum of at least a portion of field value weights for the at least two search criteria field values that match field values in records in the same entity representation;

    merging the tables according to entity representation, resulting in a merged table;

    summing field value weights according to entity representation in the merged table, resulting in a plurality of summed weights, one summed weight for each entity representation;

    ranking entity representations in the merged table according to the plurality of summed weights;

    determining a highest ranked entity representation;

    calculating a confidence level reflecting a likelihood that the highest ranked entity representation corresponds to the plurality of query field values; and

    outputting, if the confidence level exceeds a predetermined threshold, an identifier for the highest ranked entity representation.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×