Method and System for Filtering False Positives
2 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of systems and methods for reducing false positives during the linking of data records are disclosed herein. Broadly speaking, embodiments of the present invention may be used in the generation of an overall weight from the comparison of various attributes of data records, where the linking of the data records is dependent on the overall weight. More specifically, embodiments of the present invention may calculate a false positive penalty based on a set of results, each of the set of results based on a comparison of an attribute. The false positive penalty may be subtracted from the overall weight generated from the comparison of the attributes of data records to adjust the overall weight. By configuring which attributes of the data records are used as the set of attributes for generating the false positive penalty, and the penalties associated with a particular combination of results for the comparisons of these attributes, the incidence of false positives in the linking of data records may be significantly reduced.
-
Citations
19 Claims
-
1. (canceled)
-
2. A method for association of data records, comprising:
- providing a system comprising an identity hub running an identity hub engine, the identity hub coupled to one or more external data sources through one or more networks, each external data source at a corresponding database;
receiving a first data record or a second data record from the one or more external data sources at the identity hub; obtaining a first set of results, wherein each of the first set of results is a value generated based on a comparison between one of a first set of attributes from the first data record and the second data record, wherein the comparison between each of the first set of attributes is performed by the identity hub engine; determining a first overall weight for a comparison between the first data record and the second data record using the first set of results; generating a first false positive penalty based on the first set of results wherein the first false positive penalty is associated with the comparison of the first data record and the second record; adjusting the first overall weight to reduce the likelihood of the incorrect linking of the first data record and second data record; and determining whether the first data record and the second data record should be linked based on the adjusted first overall weight. - View Dependent Claims (3, 4, 5, 6, 7)
- providing a system comprising an identity hub running an identity hub engine, the identity hub coupled to one or more external data sources through one or more networks, each external data source at a corresponding database;
-
8. A computer readable storage media, comprising instructions translatable for implementing an identity hub engine on an identity hub the identity hub coupled to one or more external data sources through one or more networks, each external data source at a corresponding database the identity hub engine operable for:
-
receiving a first data record or a second data record from the one or more external data sources at the identity hub; obtaining a first set of results, wherein each of the first set of results is a value generated based on a comparison between one of a first set of attributes from the first data record and the second data record, wherein the comparison between each of the first set of attributes is performed by the identity hub engine; determining a first overall weight for a comparison between the first data record and the second data record using the first set of results; generating a first false positive penalty based on the first set of results wherein the first false positive penalty is associated with the comparison of the first data record and the second record; adjusting the first overall weight to reduce the likelihood of the incorrect linking of the first data record and second data record; and determining whether the first data record and the second data record should be linked based on the adjusted first overall weight. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A system for the linking of data records, comprising:
-
one or more data sources, each data source at a corresponding database; and an identify hub linked to the one or more data sources through one or more networks, wherein the identity hub comprising a computer readable medium including instructions operable for implementing an identity hub engine for; receiving a first data record or a second data record from the one or more external data sources at the identity hub; obtaining a first set of results, wherein each of the first set of results is a value generated based on a comparison between one of a first set of attributes from the first data record and the second data record, wherein the comparison between each of the first set of attributes is performed by the identity hub engine; determining a first overall weight for a comparison between the first data record and the second data record using the first set of results; generating a first false positive penalty based on the first set of results wherein the first false positive penalty is associated with the comparison of the first data record and the second record; adjusting the first overall weight to reduce the likelihood of the incorrect linking of the first data record and second data record; and determining whether the first data record and the second data record should be linked based on the adjusted first overall weight. - View Dependent Claims (15, 16, 17, 18, 19)
-
Specification