De-duping identities using network analysis and behavioral comparisons
First Claim
1. A method comprising:
- identifying, by a processing device, comparison subjects comprising a first subject and a second subject;
building, by the processing device, a first network of a first profile of the first subject and a second network of a second profile of the second subject, wherein the first network and the second network are multi-degree connections networks;
comparing, by the processing device, the first network and second network to produce a similarity score, wherein comparing comprises examining first-degree connections between the first network and the second network and accounting for centralities that rely on information about the entire first network and the entire second network; and
responsive to the similarity score exceeding a similarity threshold,merging, by the processing device, the first profile and the second profile into a common profile for the first subject and the second subject.
2 Assignments
0 Petitions
Accused Products
Abstract
A processing device identifies comparison subjects comprising a first subject and a second subject. The processing device builds a first network of a first profile of the first subject and a second network of a second profile of the second subject, wherein the first network and the second network are multi-degree connections networks. The processing device the first network and second network to produce a similarity score. The processing device examining first-degree connections between the first network and the second network and accounting for centralities that rely on information about the entire first network and the entire second network. Responsive to the similarity score exceeding a similarity threshold, the processing device merges the first profile and the second profile into a common profile for the first subject and the second subject.
16 Citations
20 Claims
-
1. A method comprising:
-
identifying, by a processing device, comparison subjects comprising a first subject and a second subject; building, by the processing device, a first network of a first profile of the first subject and a second network of a second profile of the second subject, wherein the first network and the second network are multi-degree connections networks; comparing, by the processing device, the first network and second network to produce a similarity score, wherein comparing comprises examining first-degree connections between the first network and the second network and accounting for centralities that rely on information about the entire first network and the entire second network; and responsive to the similarity score exceeding a similarity threshold, merging, by the processing device, the first profile and the second profile into a common profile for the first subject and the second subject. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system comprising:
-
a memory; a processing device operatively couple to the memory to; identify comparison subjects comprising a first subject and a second subject; build a first network of a first profile of the first subject and a second network of a second profile of the second subject, wherein the first network and the second network are multi-degree connections networks; compare the first network and second network to produce a similarity score, wherein comparing comprises examining first-degree connections between the first network and the second network and accounting for centralities that rely on information about the entire first network and the entire second network; and responsive to the similarity score exceeding a similarity threshold, merge the first profile and the second profile into a common profile for the first subject and the second subject.
-
-
15. A method, comprising:
-
linking a phone number to a subject; determining that the phone number of the subject is no longer being used; building, by a processing device, a first network of nodes comprising calls made by the subject; identifying, by the processing device, the most commonly called telephone numbers of the subject by traversing the nodes in the first network; building, by a processing device, a second network of nodes of call data records (CDR) of telephone numbers of the subject; identifying, by the processing device, call frequencies of the telephone numbers in the CDRs of the subject by traversing the nodes in the second network; and responsive to determining that one or more nodes in the second network have call frequencies of a telephone number matching corresponding nodes having the telephone number in the first network, identifying a phone associated with the telephone number as a burner phone. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A system comprising:
-
a memory; a processing device operatively couple to the memory to; link a phone number to a subject; determine that the phone number of the subject is no longer being used; build a first network of nodes comprising calls made by the subject; identify the most commonly called telephone numbers of the subject by traversing the nodes in the first network; build a second network of nodes of call data records (CDR) of telephone numbers of the subject; identify call frequencies of the telephone numbers in the CDRs of the subject by traversing the nodes in the second network; and responsive to determining that one or more nodes in the second network have call frequencies of a telephone number matching corresponding nodes having the telephone number in the first network, identify a phone associated with the telephone number as a burner phone.
-
Specification