Data fusion using behavioral factors
First Claim
1. A system comprising:
- one or more processing devices; and
one or more storage devices storing instructions that, when executed by the one or more processing devices, cause the one or more processing devices to perform the following operations;
access a first data set associated with a first group of users, the first data set including demographic data, online behavior data, and additional user data associated with the users in the first group;
access a second data set associated with a second group of users, the second data set including demographic data and online behavior data but not additional user data associated with the users in the second group;
group users in the first group and users in the second group into one or more subsets based on the demographic data included in the first data set and the demographic data included in the second data set;
create a vector of online behavior data for each user from the first group grouped in a first subset and for each user from the second group grouped into the first subset;
match a first user in the first group with a second user in the second group by;
determining a distance between the vector created for the second user and each of the vectors created for the users from the first group grouped in the first subset,determining that the distance between the vector created for the first user and the vector created for the second user is smaller than the distances between the vector created for the second user and the other users from the first group grouped in the subset, andmatching the first user with the second user in response to determining that the distance between the vector created for the first user and the vector created for the second user is smaller than the distances between the vector created for the second user and the other users from the first group grouped in the subset;
generate, based on matching the first user in the first group with the second user in the second group, an augmented second data set that includes additional user data associated with the second user in the second group; and
generate one or more reports based on the augmented second data set.
7 Assignments
0 Petitions
Accused Products
Abstract
A first data set associated with a first group of users is accessed. The first data set includes demographic data, online behavior data, and additional user data associated with the users in the first group. A second data set associated with a second group of users is accessed. The second data set includes demographic data and online behavior data but not additional user data associated with the users in the second group. One or more sets of matched users are determined based on the demographic data and online behavior data included in the first data set and the demographic data and online behavior data included in the second data set. Each set includes a user from the first group matched with a user from the second group. Based on the one or more sets of matched users, an augmented second data set that includes additional user data associated with the users in the second group is generated. One or more reports are generated based on the augmented second data set.
-
Citations
14 Claims
-
1. A system comprising:
-
one or more processing devices; and one or more storage devices storing instructions that, when executed by the one or more processing devices, cause the one or more processing devices to perform the following operations; access a first data set associated with a first group of users, the first data set including demographic data, online behavior data, and additional user data associated with the users in the first group; access a second data set associated with a second group of users, the second data set including demographic data and online behavior data but not additional user data associated with the users in the second group; group users in the first group and users in the second group into one or more subsets based on the demographic data included in the first data set and the demographic data included in the second data set; create a vector of online behavior data for each user from the first group grouped in a first subset and for each user from the second group grouped into the first subset; match a first user in the first group with a second user in the second group by; determining a distance between the vector created for the second user and each of the vectors created for the users from the first group grouped in the first subset, determining that the distance between the vector created for the first user and the vector created for the second user is smaller than the distances between the vector created for the second user and the other users from the first group grouped in the subset, and matching the first user with the second user in response to determining that the distance between the vector created for the first user and the vector created for the second user is smaller than the distances between the vector created for the second user and the other users from the first group grouped in the subset; generate, based on matching the first user in the first group with the second user in the second group, an augmented second data set that includes additional user data associated with the second user in the second group; and generate one or more reports based on the augmented second data set. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-implemented method comprising:
-
accessing a first data set associated with a first group of users, the first data set including demographic data, online behavior data, and additional user data associated with the users in the first group; accessing a second data set associated with a second group of users, the second data set including demographic data and online behavior data but not additional user data associated with the users in the second group; grouping users in the first group and users in the second group into one or more subsets based on the demographic data included in the first data set and the demographic data included in the second data set creating a vector of online behavior data for each user from the first group grouped in a first subset and for each user from the second group grouped into the first subset matching a first user in the first group with a second user in the second group by; determining a distance between the vector created for the second user and each of the vectors created for the users from the first group grouped in the first subset, determining that the distance between the vector created for the first user and the vector created for the second user is smaller than the distances between the vector created for the second user and the other users from the first group grouped in the subset, and matching the first user with the second user in response to determining that the distance between the vector created for the first user and the vector created for the second user is smaller than the distances between the vector created for the second user and the other users from the first group grouped in the subset; generating, based on matching the first user in the first group with the second user in the second group, an augmented second data set that includes additional user data associated with the second user in the second group; and generating one or more reports based on the augmented second data set. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A system comprising:
-
one or more processing devices; and one or more storage devices storing instructions that, when executed by the one or more processing devices, cause the one or more processing devices to perform the following operations; access a first set of demographic data for a first group of client systems, the first set of demographic data being determined based on survey data collected from each member of the first group of client systems; access a second set of demographic data for a second group of client systems, the second set of demographic data being determined based on profile data associated with each member of the second group of client systems; compare the first set of demographic data with the second set of demographic data; based on the comparison of the first set of demographic data with the second set of demographic data, group, into subsets, one or more members of the first group of client systems with one or more members of the second group of client systems such that members of each subset share matching demographic data; access online behavior data for each member of a subset; compare online behavior data of each member of the subset with online behavior data of other members of the subset by; creating a vector of online behavior data for each client system from the first group grouped in a first subset and for each client system from the second group grouped into the first subset, the first group grouped in the first subset including a first client system and the second group grouped in the first subset including a second client system, determining a distance between the vector created for the second client system and each of the vectors created for the client systems from the first group grouped in the first subset, and determining that the distance between the vector created for the first client system and the vector created for the second client system is smaller than the distances between the vector created for the second client system and the other client systems from the first group grouped in the first subset; based on the comparison the online behavior data of each member of the subset with online behavior data of the other members of the subset, associate, within the subset, each member of the second group of client systems with a member of the first group of client systems, wherein associating each member of the second group of client systems with a member of the first group of client systems includes matching the first client system with the second client system in response to determining that the distance between the vector created for the first client system and the vector created for the second client system is smaller than the distances between the vector created for the second client system and the other client systems from the first group grouped in the first subset; for each member of the second group of client systems within the subset, associate survey data collected from the member of the first group of client systems with the member of the second group of client systems with whom the member of the first group of client systems has been associated; and generate one or more reports based on the association of survey data. - View Dependent Claims (12, 13, 14)
-
Specification