×

Method and system for finding similar records in mixed free-text and structured data

  • US 20020152208A1
  • Filed: 03/06/2002
  • Published: 10/17/2002
  • Est. Priority Date: 03/07/2001
  • Status: Active Grant
First Claim
Patent Images

1. A method for determining whether records are similar in a database containing both structured and unstructured, free-text data, the method comprising the steps of:

  • accessing two of the records from the database for evaluation; and

    evaluating a match between the two records as a weighted match between each of a plurality of available fields, such that a matching process is selected as appropriate from among a group of matching processes including strict Boolean, ordinal, and vector-based matching processes, wherein;

    when a strict Boolean matching process is selected, applying a match function as an exact match test;

    when an ordinal matching process is selected, applying a match ffunction that makes use of information concerning the size and ordering of the data domain; and

    when a vector-based matching process is selected applying a match function that uses a vector space frequency test.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×