Fuzzy Database Matching
First Claim
1. A method of identifying possible matches between a sample record and a plurality of stored records, the method comprising:
- extracting from each of the stored records a plurality of index characteristics, said index characteristics falling within an index characteristic space;
maintaining a look-up table defining said index characteristic space, said look-up table having a plurality of rows, each row corresponding to a unique index characteristic within said index characteristic space;
maintaining a plurality of record occurrence lists, each said list being linked from a specific row in said look-up table corresponding to a specific index characteristic, and each said list identifying those stored records from which said specific index characteristic and index characteristics within a defined proximity to said specific index characteristics within said index characteristic space have been extracted;
extracting sample index characteristics from a sample record;
using said sample index characteristics as indexes to address said look-up table to look up a corresponding plurality of record occurrence lists which are associated with said sample index characteristics;
counting the number of occurrences of respective stored records identified within said record occurrence lists; and
identifying a given stored record as being a possible match with the sample if said count for said given stored record exceeds a required threshold.
0 Assignments
0 Petitions
Accused Products
Abstract
A method of improving the speed with which a sample such as a biometric sample can be fuzzily matched against records in a database, comprises extracting characteristics from the sample, and using those extracted characteristics as indexes (70) to address a lookup table (25). Each row within the lookup table points to an individual record occurrence list (28, 30, 32) which contain details of not only the stored records from which the given characteristic can be extracted, but also those records having an extracted characteristic which are within a defined proximity to the said characteristic. Characteristics are extracted from the sample record, and a given stored record is identified as being a possible match with the sample if it appears in a required number of record occurrence lists.
-
Citations
18 Claims
-
1. A method of identifying possible matches between a sample record and a plurality of stored records, the method comprising:
-
extracting from each of the stored records a plurality of index characteristics, said index characteristics falling within an index characteristic space; maintaining a look-up table defining said index characteristic space, said look-up table having a plurality of rows, each row corresponding to a unique index characteristic within said index characteristic space; maintaining a plurality of record occurrence lists, each said list being linked from a specific row in said look-up table corresponding to a specific index characteristic, and each said list identifying those stored records from which said specific index characteristic and index characteristics within a defined proximity to said specific index characteristics within said index characteristic space have been extracted; extracting sample index characteristics from a sample record; using said sample index characteristics as indexes to address said look-up table to look up a corresponding plurality of record occurrence lists which are associated with said sample index characteristics; counting the number of occurrences of respective stored records identified within said record occurrence lists; and identifying a given stored record as being a possible match with the sample if said count for said given stored record exceeds a required threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for identifying possible matches between a sample record and a plurality of stored records, the system comprising:
-
a computer processor coupled to a database containing a plurality of index characteristics extracted from said stored records, said index characteristics falling within an index characteristic space; a look-up table defining said characteristic space, said look-up table having a plurality of rows, each row corresponding to a unique index characteristic within said index characteristic space; a plurality of record occurrence lists, each said list being linked from a specific row in said look-up table corresponding to a specific index characteristic, and each said list identifying those stored records from which said specific index characteristic and index characteristics within a defined proximity to said specific index characteristics within said index characteristic space have been extracted; and whereby the system is configured to; extract sample index characteristics from a sample record, and use said sample index characteristics as indexes to address said look-up table to look up a corresponding plurality of record occurrence lists which are associated with said sample index characteristics; count the number of occurrences of respective stored records identified by said record occurrence lists; and identify a given stored record as being a possible match with the sample record if said count for said given stored record exceeds a required threshold. - View Dependent Claims (16, 17, 18)
-
Specification