Identifying multiple devices belonging to a single user
First Claim
1. In an environment in which analytics data is compiled from users using devices connected to a computer network, a method for grouping devices to facilitate collecting analytics data for particular users from multiple devices, the method comprising:
- excluding, from information relating to user identifications and devices on which the user identifications were received, one or more of;
(i) a particular user identification based on determining that data regarding the particular user identification is noisy, wherein determining that the particular user identification is noisy comprises determining that the particular user identification was received by more than a threshold number of devices, and(ii) a particular device based on determining that data regarding the particular device is noisy, wherein determining that the particular device is noisy comprises determining that the particular device received more than a threshold number of different user identifications;
identifying, based on the information excluding the one or more of the particular user identification and the particular device, a plurality of clusters including a first cluster comprising one or more devices and a second cluster comprising one or more devices, wherein the first cluster is initially identified by identifying devices on which a first user account authentication was received and the second cluster is initially identified by identifying devices on which a second user account authentication was received;
identifying that a particular device is in both the first cluster and the second cluster; and
based on identifying that the particular device is in both the first cluster and the second cluster, determining, based on a predetermined criteria, to;
merge the first cluster and the second cluster to create a single merged cluster;
orchange the first cluster or the second cluster so that the particular device is only in one of the first cluster or the second cluster,wherein analytics data from devices in the first cluster is associated with a first user and analytics data from devices in the second cluster is associated with a second user.
3 Assignments
0 Petitions
Accused Products
Abstract
Techniques are disclosed that provide more accurate clustering of devices by forming clusters of devices and merging or changing clusters based on predetermined criteria. The technique starts with a large number of clusters (e.g., one for each account) and refines the clusters, for example, by merging clusters or determining which cluster a given device should be in when the device is associated with multiple clusters. One technique iteratively adjusts clusters of devices by merging clusters determined to be associated with a single user until a cluster contains all of the devices and accounts expected to be associated with a single user.
-
Citations
16 Claims
-
1. In an environment in which analytics data is compiled from users using devices connected to a computer network, a method for grouping devices to facilitate collecting analytics data for particular users from multiple devices, the method comprising:
-
excluding, from information relating to user identifications and devices on which the user identifications were received, one or more of; (i) a particular user identification based on determining that data regarding the particular user identification is noisy, wherein determining that the particular user identification is noisy comprises determining that the particular user identification was received by more than a threshold number of devices, and (ii) a particular device based on determining that data regarding the particular device is noisy, wherein determining that the particular device is noisy comprises determining that the particular device received more than a threshold number of different user identifications; identifying, based on the information excluding the one or more of the particular user identification and the particular device, a plurality of clusters including a first cluster comprising one or more devices and a second cluster comprising one or more devices, wherein the first cluster is initially identified by identifying devices on which a first user account authentication was received and the second cluster is initially identified by identifying devices on which a second user account authentication was received; identifying that a particular device is in both the first cluster and the second cluster; and based on identifying that the particular device is in both the first cluster and the second cluster, determining, based on a predetermined criteria, to; merge the first cluster and the second cluster to create a single merged cluster;
orchange the first cluster or the second cluster so that the particular device is only in one of the first cluster or the second cluster, wherein analytics data from devices in the first cluster is associated with a first user and analytics data from devices in the second cluster is associated with a second user. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computing system comprising:
-
a non-transitory computer-readable medium; and a processor communicatively coupled to the non-transitory computer-readable medium, wherein the processor is configured for executing instructions stored in the non-transitory computer-readable medium on one or more devices, wherein, when executed by the processor, the instructions perform operations comprising; excluding, from information relating to user identifications and devices on which the user identifications were received, one or more of; (i) a particular user identification based on determining that data regarding the particular user identification is noisy, wherein determining that the particular user identification is noisy comprises determining that the particular user identification was received by more than a threshold number of devices, and (ii) a particular device based on determining that data regarding the particular device is noisy, wherein determining that the particular device is noisy comprises determining that the particular device received more than a threshold number of different user identifications; identifying, based on the information excluding the one or more of the particular user identification and the particular device, a plurality of clusters including a first cluster comprising one or more devices and a second cluster comprising one or more devices, wherein the first cluster is initially identified by identifying devices on which a first user account authentication was received and the second cluster is initially identified by identifying devices on which a second user account authentication was received; identifying that a particular device is in both the first cluster and the second cluster; and based on identifying that the particular device in both the first cluster and the second cluster, determining, based on predetermined criteria, to; merge the first cluster and the second cluster to create a single merged cluster;
orchange the first cluster or the second cluster so that the particular device is only in one of the first cluster or the second cluster, wherein analytics data from devices in the first cluster is associated with a first user and analytics data from devices in the second cluster is associated with a second user. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A non-transitory computer-readable medium embodying program code executable by a computer system, the non-transitory computer-readable medium comprising:
-
program code for excluding, from information relating to user identifications and devices on which the user identifications were received, one or more of; (i) a particular user identification based on determining that data regarding the particular user identification is noisy, wherein determining that the particular user identification is noisy comprises determining that the particular user identification was received by more than a threshold number of devices, and (ii) a particular device based on determining that data regarding the particular device is noisy, wherein determining that the particular device is noisy comprises determining that the particular device received more than a threshold number of different user identifications; program code for identifying, based on the information excluding the one or more of the particular user identification and the particular device, a plurality of clusters including a first cluster comprising one or more devices and a second cluster comprising one or more devices, wherein the first cluster is initially identified by identifying devices on which a first user account authentication was received and the second cluster is initially identified by identifying devices on which a second user account authentication was received; program code for identifying that a particular device is in both the first cluster and the second cluster; program code for based on identifying that the particular device is in both the first cluster and the second cluster, determining, based on predetermined criteria, to; merge the first cluster and the second cluster to create a single merged cluster;
orchange the first cluster or the second cluster so that the particular device is only in one of the first cluster or the second cluster, wherein analytics data from devices in the first cluster is associated with a first user and analytics data from devices in the second cluster is associated with a second user. - View Dependent Claims (13, 14, 15, 16)
-
Specification