×

Bulk deduplication detection

  • US 10,152,497 B2
  • Filed: 02/24/2016
  • Issued: 12/11/2018
  • Est. Priority Date: 02/24/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method comprising:

  • generating, by a database system, a first cluster of records from a group of records;

    generating, by the database system, a second cluster of records from the group of records;

    causing, by the database system, sets of duplicate records in the first cluster of records to be identified;

    causing, by the database system, sets of duplicate records in the second cluster of records to be identified;

    merging, by the database system, at least two sets of duplicate records associated with both the first cluster and the second cluster of records to form a merged set of duplicate records, wherein a set of duplicate records is implemented using a linked list having a head node and a body node for each record in the set of duplicate records and wherein the merging is performed based on the at least two sets of duplicate records having a common record and comprises merging a linked list associated with each set of duplicate records; and

    removing, by the database system, one or more duplicate records from the merged set of duplicate records.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×