STATISTICAL MEASURE AND CALIBRATION OF SEARCH CRITERIA WHERE ONE OR BOTH OF THE SEARCH CRITERIA AND DATABASE IS INCOMPLETE
First Claim
1. A method of identifying an entity representation in an electronic universal database that corresponds to an entity representation in an electronic foreign database, each database comprising a plurality of entity representations, each entity representation comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight, the method comprising:
- electronically storing a plurality of field tables, each field table corresponding to a particular field, each field table comprising field value weights for each unique pair consisting of an arbitrary entity representation from the universal database and a field value appearing in the particular field of a record in the arbitrary entity representation from the universal database;
receiving a plurality of search criteria field values identifying an entity representation in the foreign database;
for each search criteria field value, fetching a weight from an associated field table corresponding to the search criteria field value;
summing results of the step of fetching according to entity representations from the universal database, resulting in a plurality of summed weights, one summed weight for each of a plurality of entity representations from the universal database;
ranking entity representations according to the plurality of summed weights;
determining a highest ranked entity representation;
calculating a confidence level reflecting a likelihood that the highest ranked entity representation corresponds to the entity representation identified by the search criteria field values; and
outputting, if the confidence level exceeds a predetermined threshold, an identifier for the highest ranked entity representation.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed is a system for, and method of, identifying an entity representation. In some embodiments, search criteria are used to identify an entity representation in a universal database, and this identification is then used to identify a corresponding entity representation in a foreign database. Certain embodiments provide assurance, with a know probability of error, that the entity representation identified in the universal database is correct.
102 Citations
28 Claims
-
1. A method of identifying an entity representation in an electronic universal database that corresponds to an entity representation in an electronic foreign database, each database comprising a plurality of entity representations, each entity representation comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight, the method comprising:
-
electronically storing a plurality of field tables, each field table corresponding to a particular field, each field table comprising field value weights for each unique pair consisting of an arbitrary entity representation from the universal database and a field value appearing in the particular field of a record in the arbitrary entity representation from the universal database; receiving a plurality of search criteria field values identifying an entity representation in the foreign database; for each search criteria field value, fetching a weight from an associated field table corresponding to the search criteria field value; summing results of the step of fetching according to entity representations from the universal database, resulting in a plurality of summed weights, one summed weight for each of a plurality of entity representations from the universal database; ranking entity representations according to the plurality of summed weights; determining a highest ranked entity representation; calculating a confidence level reflecting a likelihood that the highest ranked entity representation corresponds to the entity representation identified by the search criteria field values; and outputting, if the confidence level exceeds a predetermined threshold, an identifier for the highest ranked entity representation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 14)
-
-
8. A method of identifying an entity representation in an electronic universal database that corresponds to an entity representation in an electronic foreign database, each database comprising a plurality of entity representations, each entity representation comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight, the method comprising:
-
receiving a plurality of search criteria field values identifying an entity representation in the foreign database; ranking entity representations in the universal database according to summed field value weights, wherein each summed field value weight comprises weights corresponding to field values, from records within a same entity representation from the universal database, that match a search criteria field value, and wherein, for each summed field value weight, each weight corresponding to field values is counted at most once for each search criteria field value; determining a highest ranked entity representation; calculating a confidence level reflecting a likelihood that the highest ranked entity representation corresponds to the entity representation identified by the search criteria field values; and outputting, if the confidence level exceeds a predetermined threshold, an identifier for the highest ranked entity representation. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
15. A system for identifying an entity representation in an electronic universal database that corresponds to an entity representation in an electronic foreign database, each database comprising a plurality of entity representations, each entity representation comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight, the system comprising:
-
an electronic universal database comprising a plurality of electronically stored entity representations, each entity representation comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight; a processor programmed to form and store a plurality of field tables, each field table corresponding to a particular field, each field table comprising field value weights for each unique pair consisting of an arbitrary entity representation from the universal database and a field value appearing in the particular field of a record in the arbitrary entity representation from the universal database; an electronic memory storing a plurality of search criteria field values identifying an entity representation in the foreign database; a processor programmed to, for each search criteria field value, fetch a weight from an associated field table corresponding to the search criteria field value; a processor programmed to sum the fetched weights according to entity representations from the universal database, resulting in a stored plurality of summed weights, one summed weight for each of a plurality of entity representations from the universal database; a processor configured to rank entity representations according to the plurality of summed weights; a processor programmed to determine a highest ranked entity representation; a processor programmed to calculate a confidence level reflecting a likelihood that the highest ranked entity representation corresponds to the entity representation identified by the search criteria field values; and a processor programmed to output, if the confidence level exceeds a predetermined threshold, an identifier for the highest ranked entity representation. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
-
22. A system for identifying an entity representation in an electronic universal database that corresponds to an entity representation in an electronic foreign database, each database comprising a plurality of entity representations, each entity representation comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight, the system comprising:
-
an electronic universal database comprising a plurality of electronically stored entity representations, each entity representation comprising a plurality of linked records, each record comprising a plurality of fields, each field capable of containing a field value, each field value associated with a field value weight; an electronic memory storing a plurality of search criteria field values identifying an entity representation in the foreign database; a processor programmed to rank entity representations in the universal database according to summed field value weights, wherein each summed field value weight comprises weights corresponding to field values, from records within a same universal entity representation, that match a search criteria field value, and wherein, for each summed field value weight, each weight corresponding to field values is counted at most once for each search criteria field value; a processor programmed to determine a highest ranked universal entity representation; a processor programmed to calculate a confidence level reflecting a likelihood that the highest ranked entity representation corresponds to the entity representation identified by the search criteria field values; and a processor programmed to output, if the confidence level exceeds a predetermined threshold, an identifier for the highest ranked entity representation. - View Dependent Claims (23, 24, 25, 26, 27, 28)
-
Specification