SYSTEMS, METHODS, AND SOFTWARE FOR ENTITY RELATIONSHIP RESOLUTION
First Claim
1. A system comprising:
- means, responsive to a one or more data fields in a public record, for retrieving a set of candidate named entity records from a master named entity database based on one of a set of two or more blocking queries;
matching means for calculating similarity scores for one or more of the data fields in the public record and data fields in the candidate named entity records; and
means for determining a confidence rating for one or more of the set of similarity scores between the public record and the candidate public record into a confidence rating.
4 Assignments
0 Petitions
Accused Products
Abstract
To facilitate access to public records, the present inventors devised, among other things, an entity resolution system. The exemplary system includes master records database of 300 million entities, which is partitioned into multiple distinct portions. The exemplary system extracts entity information from input public records and constructs one or more blocking queries against specific portions of the master records database to identify one or more sets of candidate records. Feature vectors are defined for the candidate records and machine learning techniques, such as Support Vector Machine, are used to determine which of the candidate records from the master records database match the input public records. Candidate records that match are logically associated with public records, enabling ready access via direct or indirect queries.
54 Citations
16 Claims
-
1. A system comprising:
-
means, responsive to a one or more data fields in a public record, for retrieving a set of candidate named entity records from a master named entity database based on one of a set of two or more blocking queries; matching means for calculating similarity scores for one or more of the data fields in the public record and data fields in the candidate named entity records; and means for determining a confidence rating for one or more of the set of similarity scores between the public record and the candidate public record into a confidence rating. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method comprising:
-
retrieving a set of candidate named entity records from a master named entity database based on one of a set of two or more blocking queries, with each block query based on one or more data fields in a public record; calculating similarity scores for one or more of the data fields in the public record and data fields in the candidate named entity records; and means for determining a confidence rating for one or more of the set of similarity scores between the public record and the candidate public record into a confidence rating. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
Specification