Determining a likelihood that two entities are the same
First Claim
1. A computer-implemented method executed by processor circuitry for determining a likelihood that a first entity and a second entity are a same entity, the method comprising:
- retrieving, from a first database, a first data related to the first entity and retrieving, from a second database, a second data related to the second entity;
segmenting the retrieved first data into a first set of individual components;
segmenting the retrieved second data into a second set of individual components, wherein each component in the first set of individual components corresponds to a component in the second set of individual components;
determining a characteristic value for each component in the first and second sets of individual components;
determining, for each component in the first set of individual components, a distance value based on a distance of the determined characteristic value of the component from the determined characteristic value of the corresponding component in the second set of individual components;
determining the likelihood that the first entity and the second entity are the same entity based on the determined distance values; and
altering a record related to the first entity or the second entity upon determining the likelihood is greater than or equal to a first likelihood; and
adding a second record from the first data related to the first entity to the second data related to the second entity if the likelihood is greater than or equal to a second likelihood.
1 Assignment
0 Petitions
Accused Products
Abstract
The disclosed embodiments provide a system that determines a likelihood that a first entity and a second entity are the same entity. During operation, the system obtains financial data related to the first entity and obtains financial data related to the second entity. Next, the system determines the likelihood that the first entity and the second entity are the same entity based on the relationship between the financial data for the first entity and the financial data for the second entity. Then, the system alters a record related to the first entity or the second entity based on the likelihood.
46 Citations
19 Claims
-
1. A computer-implemented method executed by processor circuitry for determining a likelihood that a first entity and a second entity are a same entity, the method comprising:
-
retrieving, from a first database, a first data related to the first entity and retrieving, from a second database, a second data related to the second entity; segmenting the retrieved first data into a first set of individual components; segmenting the retrieved second data into a second set of individual components, wherein each component in the first set of individual components corresponds to a component in the second set of individual components; determining a characteristic value for each component in the first and second sets of individual components; determining, for each component in the first set of individual components, a distance value based on a distance of the determined characteristic value of the component from the determined characteristic value of the corresponding component in the second set of individual components; determining the likelihood that the first entity and the second entity are the same entity based on the determined distance values; and altering a record related to the first entity or the second entity upon determining the likelihood is greater than or equal to a first likelihood; and
adding a second record from the first data related to the first entity to the second data related to the second entity if the likelihood is greater than or equal to a second likelihood. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for determining a likelihood that a first entity and a second entity are a same entity, comprising:
processing circuitry configured to; retrieve, from a first database, a first data related to the first entity and retrieve, from a second database, a second data related to the second entity; segment the retrieved first data into a first set of individual components; segment the retrieved second data into a second set of individual components, wherein each component in the first set of individual components corresponds to a component in the second set of individual components; determine a characteristic value for each component in the first and second sets of individual components; determine, for each component in the first set of individual components, a distance value based on a distance of the determined characteristic value of the component from the determined characteristic value of the corresponding component in the second set of individual components; determine the likelihood that the first entity and the second entity are the same entity based on the determined distance values; and alter a record related to the first entity or the second entity upon determining the likelihood is greater than or equal to a first likelihood; and
add a second record from the first data related to the first entity to the second data related to the second entity if the likelihood is greater than or equal to a second likelihood.- View Dependent Claims (9, 10, 11, 12, 13, 14)
-
15. A non transitory computer-readable storage medium storing instructions that when executed by a computer cause processing circuitry to perform a method for determining a likelihood that a first entity and a second entity are a same entity, the method comprising:
-
retrieving, from a first database, a first data related to the first entity and retrieving, from a second database, a second data related to the second entity; segmenting the retrieved first data into a first set of individual components; segmenting the retrieved second data into a second set of individual components, wherein each component in the first set of individual components corresponds to a component in the second set of individual components; determining a characteristic value for each component in the first and second sets of individual components; determining, for each component in the first set of individual components, a distance value based on a distance of the determined characteristic value of the component from the determined characteristic value of the corresponding component in the second set of individual components; determining the likelihood that the first entity and the second entity are the same entity based on the determined distance values; and altering a record related to the first entity or the second entity upon determining the likelihood is greater than or equal to a first likelihood; and
adding a second record from the first data related to the first entity to the second data related to the second entity if the likelihood is greater than or equal to a second likelihood. - View Dependent Claims (16, 17, 18, 19)
-
Specification