Matching new customer records to existing customer records in a large business database using hash key
First Claim
1. The method for matching a new data set containing a record and a collection of fields to a database containing a plurality of records each having a collection of fields, the method comprising the steps of:
- reading the new data set;
validating each of the fields from the record in the new data set;
normalizing the validated fields in the record in the new data set into a standard form;
selecting a hash key for generating a candidate set of records from the database that likely matches the record from the new data set;
applying the hash key to the plurality of records in the database to generate the candidate set of records;
matching the record from the new data set to each of the records in the candidate set; and
updating the plurality of records in the database according to the match of the record from the new data set to the records in the candidate set.
1 Assignment
0 Petitions
Accused Products
Abstract
In this invention there is a method and system for matching new customer records to existing customer records in a database. The new customer records are validated for quality and normalized into a standard form. A hash key is selected to generate a candidate set of records from the existing records in the database that likely matches the new customer records. The new customer records are then matched to each of the records in the candidate set. Once the matching has been performed, a decision is made on whether to create a new customer record, update an existing record, or save the new record in a pending file for resolution at a later time. In another embodiment, there is a methodology for learning matching rules for matching records in a database. The matching rules are then used for matching a new customer record to existing records in a database.
-
Citations
18 Claims
-
1. The method for matching a new data set containing a record and a collection of fields to a database containing a plurality of records each having a collection of fields, the method comprising the steps of:
-
reading the new data set; validating each of the fields from the record in the new data set; normalizing the validated fields in the record in the new data set into a standard form; selecting a hash key for generating a candidate set of records from the database that likely matches the record from the new data set; applying the hash key to the plurality of records in the database to generate the candidate set of records; matching the record from the new data set to each of the records in the candidate set; and updating the plurality of records in the database according to the match of the record from the new data set to the records in the candidate set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for matching a new data set containing a record and a collection of fields to a database containing a plurality of records each having a collection of fields, the system comprising:
-
means for reading the new data set; means for validating each of the fields from the record in the new data set; means for normalizing the validated fields in the record in the new data set into a standard form; means for selecting a hash key for generating a candidate set of records from the database that likely matches the record from the new data set; means for applying the hash key to the plurality of records in the database to generate the candidate set of records; means for matching the record from the new data set to each of the records in the candidate set; and means for updating the plurality of records in the database according to the match of the record from the new data set to the records in the candidate set. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification