Entity representation identification using entity representation level information
First Claim
Patent Images
1. A method of identifying an entity representation in an electronic universal database that corresponds to an entity representation in an electronic foreign database, each database comprising a plurality of entity representations, the method comprising:
- receiving a plurality of search criteria field values;
determining one or more entity representations in the foreign database corresponding to one or more of the search criteria field values, each entity representation in the electronic universal database and electronic foreign database comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight, wherein each field value weight comprises an expected value of a size of a field value cohort of a record in an arbitrary entity representation in the universal database;
forming a comprehensive search criteria, based on determining the one or more entity representations in the foreign database, the comprehensive search criteria comprising a plurality of field values from a plurality of records from the entity representation in the foreign database corresponding to at least some of the search criteria field values, wherein the comprehensive search criteria comprises at least two non-identical field values associated with a same field from the plurality of records in the entity representation in the foreign database;
determining a highest ranked entity representation in the universal database according to summed field value weights for field values matching the comprehensive search criteria;
calculating a confidence level reflecting a likelihood that the highest ranked entity representation corresponds to the plurality of search criteria field values; and
outputting, when the confidence level exceeds a predetermined threshold, an identifier for the highest ranked entity representation.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed is a system for, and method of, searching for and identifying one or more entity representations using comprehensive search criteria built from known entity representations. The comprehensive search criteria are permitted to include inconsistent field values, that is, multiple different field values corresponding to the same field. The system and method may perform using search queries or batch files.
-
Citations
32 Claims
-
1. A method of identifying an entity representation in an electronic universal database that corresponds to an entity representation in an electronic foreign database, each database comprising a plurality of entity representations, the method comprising:
-
receiving a plurality of search criteria field values; determining one or more entity representations in the foreign database corresponding to one or more of the search criteria field values, each entity representation in the electronic universal database and electronic foreign database comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight, wherein each field value weight comprises an expected value of a size of a field value cohort of a record in an arbitrary entity representation in the universal database; forming a comprehensive search criteria, based on determining the one or more entity representations in the foreign database, the comprehensive search criteria comprising a plurality of field values from a plurality of records from the entity representation in the foreign database corresponding to at least some of the search criteria field values, wherein the comprehensive search criteria comprises at least two non-identical field values associated with a same field from the plurality of records in the entity representation in the foreign database; determining a highest ranked entity representation in the universal database according to summed field value weights for field values matching the comprehensive search criteria; calculating a confidence level reflecting a likelihood that the highest ranked entity representation corresponds to the plurality of search criteria field values; and outputting, when the confidence level exceeds a predetermined threshold, an identifier for the highest ranked entity representation. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of identifying a plurality of entity representations in an electronic universal database that correspond to a plurality of entity representations in an electronic foreign database, each database comprising a plurality of entity representations, the method comprising:
-
receiving a plurality of search criteria field values corresponding to a plurality of entity representations in the foreign database; determining a plurality of one or more entity representations in the foreign database corresponding to one or more of the plurality of search criteria field values, each entity representation in the electronic universal database and electronic foreign database comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight, wherein each field value weight comprises an expected value of a size of a field value cohort of a record in an arbitrary entity representation in the universal database; forming a plurality of comprehensive search criteria based on determining the one or more entity representation in the foreign database, one comprehensive search criteria for each of the plurality of entity representations in the foreign database corresponding to at least some of the search criteria field values, each comprehensive search criteria comprising a plurality of field values from a plurality of records from the corresponding entity representation in the foreign database, wherein each comprehensive search criteria comprises at least two non-identical field values associated with a same field from the plurality of records in the corresponding entity representation in the foreign database; for each comprehensive search criteria, determining a corresponding highest ranked entity representation in the universal database according to summed field value weights for field values matching the comprehensive search criteria; for each highest ranked entity representation, calculating an associated confidence level reflecting a likelihood that the highest ranked entity representation is correct for the corresponding comprehensive search criteria; and for each highest ranked entity representation, outputting, when the associated confidence level exceeds a predetermined threshold, an identifier for the highest ranked entity representation. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for identifying an entity representation in an electronic universal database that corresponds to an entity representation in an electronic foreign database, the system comprising:
-
an electronic universal database comprising a plurality of entity representations, each entity representation comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight, wherein each field value weight comprises an expected value of a size of a field value cohort of a record in an arbitrary entity representation in the universal database; an electronic memory storing a plurality of search criteria field values; a processor programmed to determine one or more entity representations in the foreign database corresponding to one or more of the search criteria field values, each entity representation in the electronic universal database and electronic foreign database comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight; a processor programmed to form and store a comprehensive search criteria based on determining the one or more entity representation in the foreign database, the comprehensive search criteria comprising a plurality of field values from a plurality of records from the entity representation in the foreign database corresponding to at least some of the search criteria field values, wherein the comprehensive search criteria comprises at least two non-identical field values associated with a same field from the plurality of records in the entity representation in the foreign database; a processor programmed to determine a highest ranked entity representation in the universal database according to summed field value weights for field values matching the comprehensive search criteria; a processor programmed to calculate a confidence level reflecting a likelihood that the highest ranked entity representation corresponds to the plurality of search criteria field values; and an output configured to output, when the confidence level exceeds a predetermined threshold, an identifier for the highest ranked entity representation. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23)
-
-
24. A system of identifying a plurality of entity representations in an electronic universal database that correspond to a plurality of entity representations in an electronic foreign database, each database comprising a plurality of entity representations, the system comprising:
-
an electronic universal database comprising a plurality of entity representations, each entity representation comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight, wherein each field value weight comprises an expected value of a size of a field value cohort of a record in an arbitrary entity representation in the universal database; an electronic memory storing a plurality of search criteria field values corresponding to a plurality of entity representations in the foreign database; a processor programmed to determine one or more entity representations in the foreign database corresponding to one or more of the plurality of search criteria field values, each entity representation in the electronic universal database and electronic foreign database comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight; a processor programmed to form and store a plurality of comprehensive search criteria based on determining the one or more entity representation in the foreign database, one comprehensive search criteria for each of the plurality of entity representations in the foreign database corresponding to at least some of the search criteria field values, each comprehensive search criteria comprising a plurality of field values from a plurality of records from the corresponding entity representation in the foreign database, wherein each comprehensive search criteria comprises at least two non-identical field values associated with a same field from the plurality of records in the corresponding entity representation in the foreign database; a processor programmed to, for each comprehensive search criteria, determine a corresponding highest ranked entity representation in the universal database according to summed field value weights for field values matching the comprehensive search criteria; a processor programmed to, for each highest ranked entity representation, calculate an associated confidence level reflecting a likelihood that the highest ranked entity representation is correct for the corresponding comprehensive search criteria; and an output configured to, for each highest ranked entity representation, output, if when the associated confidence level exceeds a predetermined threshold, an identifier for the highest ranked entity representation. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32)
-
Specification