Configurable search tool for finding and scoring non-exact matches in a relational database
First Claim
1. A computer-implemented method of searching for exact and non-exact matches in a first database, comprising the steps of:
- transforming and standardizing values of selected attributes stored in the first database according to selected first transformation functions while maintaining the first database unchanged;
storing the transformed and standardized values of the selected attributes in a second database that is separate from the first database;
providing an input record against which to compare contents of the second database;
transforming and standardizing values of selected attributes of the provided input record according to selected second transformation functions;
searching only the second database for at least one of exact and non-exact matches to the transformed and standardized values of the selected attributes of the input record according to selected match rules, andassigning a score to each of the at least one exact and non-exact matches, the score assigning step including assigning a percentage weight to each of the first transformation functions.
4 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method of searching for exact and non-exact matches in a first database includes steps of transforming and standardizing values of selected attributes stored in the first database according to selected first transformation functions while maintaining the first database unchanged; storing the transformed and standardized values of the selected attributes in a second database that is separate from the first database; providing an input record against which to compare contents of the second database; transforming and standardizing values of selected attributes of the provided input record according to selected second transformation functions, and searching only the second database for at least one of exact and non-exact matches to the transformed and standardized values of the selected attributes of the input record according to selected match rules.
-
Citations
26 Claims
-
1. A computer-implemented method of searching for exact and non-exact matches in a first database, comprising the steps of:
-
transforming and standardizing values of selected attributes stored in the first database according to selected first transformation functions while maintaining the first database unchanged; storing the transformed and standardized values of the selected attributes in a second database that is separate from the first database; providing an input record against which to compare contents of the second database; transforming and standardizing values of selected attributes of the provided input record according to selected second transformation functions; searching only the second database for at least one of exact and non-exact matches to the transformed and standardized values of the selected attributes of the input record according to selected match rules, and assigning a score to each of the at least one exact and non-exact matches, the score assigning step including assigning a percentage weight to each of the first transformation functions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer-implemented method of maintaining records in a first database, the first database storing a plurality of attributes, the method comprising:
-
a first transforming and standardizing step to transform and standardize values of selected attributes stored in the first database according to first transformation and standardization functions while maintaining the first database unchanged; a storing step to store the transformed and standardized attribute values in a second database that is separate from the first database; a providing step to provide an input record against which to compare contents of the second database; a second transforming and standardizing step to transform and standardize values of the attributes of the provided input record according to second transformation and standardization functions; an applying step to apply match rules to compare the standardized attribute values of the provided input record to the standardized attribute values stored in the second database, and to identify a set of potential duplicate records based upon results of the application of the match rules; scoring the set of potential duplicate records, the scoring step including a step of assigning a percentage weight to each of the first transformation functions, and carrying out one of; merging the provided input record with the at least one record of the set of potential duplicate records; and preventing the provided input record from being incorporated in the first database. - View Dependent Claims (16, 17)
-
-
18. A database system, comprising:
-
a first database that stores attributes and corresponding values of the attributes; first means for reading, transforming and standardizing the attribute values stored in the first database according to selected first transformation functions such that an effect of typographical errors and variations in the stored attribute values is minimized, the first means maintaining the first database unchanged; a second database coupled to the first means, the second database storing the transformed and standardized attribute values, and second means for searching only the second database for exact and non-exact matches to an input record or for duplicates according to selected match rules, wherein the selected match rules include scoring criteria and wherein a score is assigned to each of the at least one exact and non-exact matches according to the scoring criteria, and wherein the second means is further configured to assign a percentage weight to each of the first transformation functions. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26)
-
Specification