×

Robust detector of fuzzy duplicates

  • US 20060053129A1
  • Filed: 08/30/2004
  • Published: 03/09/2006
  • Est. Priority Date: 08/30/2004
  • Status: Active Grant
First Claim
Patent Images

1. One or more processor-readable program media having processor-executable instructions that, when executed by a processor, perform acts comprising:

  • obtaining a dataset comprising multiple tuples from a database;

    for each of the multiple tuples of the dataset, computing one or more nearest neighbor tuples in the dataset;

    defining multiple disjoint partitions of multiple tuples, wherein tuples in each partition comprise fuzzy duplicates of one another, such that each fuzzy duplicate tuple in a partition represents a common real world entity or phenomenon.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×