×

Text joins for data cleansing and integration in a relational database management system

  • US 20050027717A1
  • Filed: 04/21/2004
  • Published: 02/03/2005
  • Est. Priority Date: 04/21/2003
  • Status: Abandoned Application
First Claim
Patent Images

1. ) A system for string matching across multiple relations in a relational database management system comprising:

  • generating a set of strings from a set of characters, decomposing each string into a subset of tokens, establishing at least two relations within said strings, establishing a similarity threshold for said relations, sampling said at least two relations, correlating said relations for said similarity threshold and returning all of said tokens which meet the criteria of said similarity threshold.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×