Method and system for approximate matching of data records
First Claim
1. A method of matching a data record with a reference record in a system, comprising:
- (A) receiving the data record;
(B) calculating a distance between the received data record and a reference record; and
(C) determining that the reference record approximately matches the received data record if the calculated distance is less than a pre-determined threshold.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention is directed to a method of matching a data record with a reference record. The method comprises receiving the data record; searching stored reference records for a reference record that is an exact match for the received data record; and if an exact matching reference record is not found, calculating a distance between the received data record and at least one stored reference record, and determining the at least one stored reference record to be an approximate match if the calculated distance is less than a pre-determined threshold.
The inventive method and system can be used in a number of applications, including invoice processing in an Accounts Payable system, record conciliation in a securities trading system, finding preferred bonds in a bond trading system, etc.
Approximate matching records are ranked, and the closest match, or highest ranking match, may be automatically processed if within a predetermined threshold.
61 Citations
43 Claims
-
1. A method of matching a data record with a reference record in a system, comprising:
-
(A) receiving the data record;
(B) calculating a distance between the received data record and a reference record; and
(C) determining that the reference record approximately matches the received data record if the calculated distance is less than a pre-determined threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer-readable medium containing instructions that cause a computer to process a data record in a matching system, said medium comprising:
-
a storage section for storing a plurality of data records;
a first code section containing instructions that cause the computer to receive the data record;
a second code section containing instructions that cause the computer to calculate a distance between the received data record and a reference record; and
a third code section containing instructions that cause the computer to determine that the reference record approximately matches the received data record if the calculated distance is less than a pre-determined threshold. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A method of determining a record distance between a reference record and an incoming record, wherein the reference record has at least one field and the incoming record has at least one field, comprising:
-
(A) determining correspondence between at least one field in the reference record to at least one field in the incoming data record;
(B) for each corresponding pair of fields determined in step (a), calculating a field distance; and
(C) calculating said record distance by aggregating said field distances. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
-
-
36. A computer-readable medium for determining a record distance between a reference record and an incoming record, wherein the reference record has at least one field and the incoming record has at least one field, said medium comprising:
-
a first code section containing instructions that cause the computer to determine correspondence between at least one field in the reference record to at least one field in the incoming data record;
a second code section containing instructions that cause the computer to calculate a field distance for each corresponding pair of fields; and
a third code section containing instructions that cause the computer to calculate said record distance by aggregating said field distances. - View Dependent Claims (37, 38, 39, 40, 41, 42, 43)
-
Specification