×

Efficient fuzzy match for evaluating data records

  • US 20040260694A1
  • Filed: 06/20/2003
  • Published: 12/23/2004
  • Est. Priority Date: 06/20/2003
  • Status: Active Grant
First Claim
Patent Images

1. A process for testing an evaluation data record having attribute fields containing data comprising:

  • providing a reference table having a number of reference records against which a evaluation data record is tested;

    identifying reference table tokens contained within the reference records of the reference table and determining a count of tokens in the reference table classified according to attribute field; and

    assigning a similarity score to said evaluation data record in relation to a reference record within the reference table based on a combination of;

    the number of common tokens of an evaluation field of the input data record and a corresponding field within a reference record from the reference table;

    the similarity of the tokens that are not the same in the evaluation field of the input data record and the corresponding field of the reference record from the reference table; and

    a weight of the tokens of the evaluation data record that is based on a count of the tokens from a corresponding field contained within the reference table.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×