Method and system for acquisition, normalization, matching, and enrichment of data
First Claim
1. A method comprising:
- obtaining a first data set from a first data source and a second data set from a second data source, the first data set comprising a first plurality of entities and the second data set comprising a second plurality of entities;
identifying a verified same-as relationship between a first entity from the first plurality of entities and a second entity from the second plurality of entities;
comparing one or more entities of the first plurality of entities having respective indicated relationships with the first entity with one or more entities of the second plurality of entities having respective indicated relationships with the second entity;
determining that a third entity from the first plurality of entities has an inferred same-as relationship with a fourth entity from the second plurality of entities based on comparing the one or more entities of the first plurality of entities having respective indicated relationships with the first entity with the one or more entities of the second plurality of entities having respective indicated relationships with the second entity;
comparing one or more entities of the first plurality of entities having respective indicated relationships with the third entity with one or more entities of the second plurality of entities having respective indicated relationships with the fourth entity;
classifying the inferred same-as relationship into one of a plurality of accuracy classes based on comparing the one or more entities of the first plurality of entities having respective indicated relationships with the third entity with the one or more entities of the second plurality of entities having respective indicated relationships with the fourth entity; and
generating first output data indicating that the third entity has the classified inferred same-as relationship with the fourth entity.
1 Assignment
0 Petitions
Accused Products
Abstract
In one embodiment, a method includes obtaining a first data set from a first data source and a second data set from a second data source, the first data set including a first plurality of entities and the second data set including a second plurality of entities. The method also includes identifying a verified relationship between a first entity from the first plurality of entities and a second entity from the second plurality of entities and determining that a third entity from the first plurality of entities has a first same-as relationship with a fourth entity from the second plurality of entities based on one or more of the verified relationship or relationships between the first plurality of entities and the second plurality of entities. The method further includes generating first output data including the first same-as relationship.
42 Citations
19 Claims
-
1. A method comprising:
-
obtaining a first data set from a first data source and a second data set from a second data source, the first data set comprising a first plurality of entities and the second data set comprising a second plurality of entities; identifying a verified same-as relationship between a first entity from the first plurality of entities and a second entity from the second plurality of entities; comparing one or more entities of the first plurality of entities having respective indicated relationships with the first entity with one or more entities of the second plurality of entities having respective indicated relationships with the second entity; determining that a third entity from the first plurality of entities has an inferred same-as relationship with a fourth entity from the second plurality of entities based on comparing the one or more entities of the first plurality of entities having respective indicated relationships with the first entity with the one or more entities of the second plurality of entities having respective indicated relationships with the second entity; comparing one or more entities of the first plurality of entities having respective indicated relationships with the third entity with one or more entities of the second plurality of entities having respective indicated relationships with the fourth entity; classifying the inferred same-as relationship into one of a plurality of accuracy classes based on comparing the one or more entities of the first plurality of entities having respective indicated relationships with the third entity with the one or more entities of the second plurality of entities having respective indicated relationships with the fourth entity; and generating first output data indicating that the third entity has the classified inferred same-as relationship with the fourth entity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system comprising:
-
a processor; and a combination of logic and non-transitory memory including instructions, that when at least executed in part by the processor cause the system to; obtain a first data set from a first data source and a second data set from a second data source, the first data set comprising a first plurality of entities and the second data set comprising a second plurality of entities; identify a verified same-as relationship between a first entity from the first plurality of entities and a second entity from the second plurality of entities; compare one or more entities of the first plurality of entities having respective indicated relationships with the first entity with one or more entities of the second plurality of entities having respective indicated relationships with the second entity; determine that a third entity from the first plurality of entities has an inferred same-as relationship with a fourth entity from the second plurality of entities based on comparing the one or more entities of the first plurality of entities having respective indicated relationships with the first entity with the one or more entities of the second plurality of entities having respective indicated relationships with the second entity; compare one or more entities of the first plurality of entities having respective indicated relationships with the third entity with one or more entities of the second plurality of entities having respective indicated relationships with the fourth entity; classify the inferred same-as relationship into one of a plurality of accuracy classes based on comparing the one or more entities of the first plurality of entities having respective indicated relationships with the third entity with the one or more entities of the second plurality of entities having respective indicated relationships with the fourth entity; and generate first output data indicating that the third entity has the classified inferred same-as relationship with the fourth entity. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
Specification