×

Methods and systems for implementing approximate string matching within a database

  • US 8,219,550 B2
  • Filed: 03/04/2011
  • Issued: 07/10/2012
  • Est. Priority Date: 12/31/2007
  • Status: Active Grant
First Claim
Patent Images

1. A computer-based method for character string matching of a candidate character string with a plurality of character string records stored within a database, said method comprising:

  • selecting one or more of the character strings in the plurality of character string records to form a set of reference character strings using a principal components factor analysis (PCFA) to identify a set of dissimilar reference character strings in the plurality of character string records;

    generating a binary index key for each of the character strings in the set of character strings, the binary index key comprising a plurality of bits of binary information, each bit indicating a degree of matching of the character string to the set of reference character strings;

    generating a binary index key for the candidate character string;

    determining a set of character string records stored within the database that include a binary index key that exactly matches the binary index key of the candidate character string;

    from the determined set of character string records stored within the database that include a binary index key that exactly matches the binary index key of the candidate character string, locating each character string record whose selected character string matches the respective character string of the candidate string record; and

    indexing the candidate character string record within the database based on the matching.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×