Focused probabilistic entity resolution from multiple data sources
First Claim
1. A computer system, comprising:
- one or more computer readable storage mediums configured to store computer executable instructions; and
one or more computer processors in communication with the one or more computer readable storage mediums and configured to execute the computer executable instructions in order to cause the computer system to;
retrieve a plurality of data objects from a plurality of data stores, the data stores comprising at least a first data store containing data objects corresponding to entities, and a second data store containing data objects corresponding to events; and
create aggregated data objects for one or more entities, wherein creating an aggregated data object for an entity comprises;
determining a confidence value for an association between one or more of the retrieved data objects the entity based at least in part upon one or more attribute types and attribute values associated with the one or more data objects;
generating metadata associating the one or more data objects with the entity based at least in part on the confidence value, wherein the metadata is stored separately from the one or more data objects, such that the one or more data objects are unchanged; and
determining one or more additional attributes for the entity, based upon the data objects associated with the entity.
8 Assignments
0 Petitions
Accused Products
Abstract
Various systems and methods are provided for performing soft entity resolution. A plurality of data objects are retrieved from a plurality of data stores to create aggregated data objects for one or more entities. One or more retrieved data objects may be associated with the same entity, based at least in part upon one or more attribute types and attribute values of the data objects. In response to a determination that the one or more of the retrieved data objects should be associated with the same entity, metadata is generated that associates the data objects with the entity, the metadata being stored separately from the data objects, such that the underlying data objects remain unchanged. In addition, one or more additional attributes may be determined for the entity, based upon the data objects associated with the entity.
565 Citations
20 Claims
-
1. A computer system, comprising:
-
one or more computer readable storage mediums configured to store computer executable instructions; and one or more computer processors in communication with the one or more computer readable storage mediums and configured to execute the computer executable instructions in order to cause the computer system to; retrieve a plurality of data objects from a plurality of data stores, the data stores comprising at least a first data store containing data objects corresponding to entities, and a second data store containing data objects corresponding to events; and create aggregated data objects for one or more entities, wherein creating an aggregated data object for an entity comprises; determining a confidence value for an association between one or more of the retrieved data objects the entity based at least in part upon one or more attribute types and attribute values associated with the one or more data objects; generating metadata associating the one or more data objects with the entity based at least in part on the confidence value, wherein the metadata is stored separately from the one or more data objects, such that the one or more data objects are unchanged; and determining one or more additional attributes for the entity, based upon the data objects associated with the entity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-implemented method for performing probabilistic entity resolution, comprising:
-
retrieving a plurality of data objects from a plurality of data stores, the data stores comprising at least a first data store containing data objects corresponding to entities, and a second data store containing data objects corresponding to events; and creating aggregated data objects for one or more entities, wherein creating an aggregated data object for an entity comprises; determining a confidence value for an association between one or more of the retrieved data objects and the entity based at least in part upon one or more attribute types and attribute values associated with the one or more data objects; generating metadata associating the one or more data objects with the entity based at least in part on the confidence value, wherein the metadata is stored separately from the one or more data objects, such that the one or more data objects are unchanged; and determining one or more additional attributes for the entity, based upon the data objects associated with the entity. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification