Location estimation of social network users
First Claim
1. A system comprising:
- a memory;
a processor communicatively coupled to the memory;
a location estimator communicatively coupled to the memory and the processor, wherein the test location estimator is configured to;
receive a plurality of social media messages generated by a given user;
extract a plurality of location features from the social media messages;
compute, for each of the plurality of location features, a frequency of the location feature for at least one location;
determine, for each of the plurality of location features, a number of people in the at least one location who have used the location feature in their social networking messages;
determine, for each of the plurality of location features and based on the computed frequency and the determined number of people, if the location feature was included within social networking messages of a threshold percentage of people in the at least on location;
based on the location feature having been included within social networking messages of the threshold percentage of people;
adding the feature to the subset of features;
identify at least a subset of location features from the plurality of location features that are discriminative of at least one location at a location granularity level of interest;
process each of the subset of location features with at least one classifier from an ensemble of classifiers;
generate, by each of the classifiers, a location classification for each of the social media messages, each location classification comprising a location and a weight associated with that location; and
select one of the locations from the location classifications as the location of the given user based on a combination of the weights of the location classifications.
3 Assignments
0 Petitions
Accused Products
Abstract
Various embodiments of the invention relate to estimating the location of social network users. In one embodiment, a plurality of social media messages generated by a given user is received. A plurality of location features is extracted from the social media messages. Each of the location features is processed with at least one classifier from an ensemble of classifiers. A location classification is generated by each of the classifiers for each of the social media messages. Each classification comprises a location and a weight associated with that location. One of the locations is selected from the location classifications as the location of the given user based on a combination of the weights of the location classifications.
-
Citations
12 Claims
-
1. A system comprising:
-
a memory; a processor communicatively coupled to the memory; a location estimator communicatively coupled to the memory and the processor, wherein the test location estimator is configured to; receive a plurality of social media messages generated by a given user;
extract a plurality of location features from the social media messages;compute, for each of the plurality of location features, a frequency of the location feature for at least one location; determine, for each of the plurality of location features, a number of people in the at least one location who have used the location feature in their social networking messages; determine, for each of the plurality of location features and based on the computed frequency and the determined number of people, if the location feature was included within social networking messages of a threshold percentage of people in the at least on location; based on the location feature having been included within social networking messages of the threshold percentage of people; adding the feature to the subset of features; identify at least a subset of location features from the plurality of location features that are discriminative of at least one location at a location granularity level of interest; process each of the subset of location features with at least one classifier from an ensemble of classifiers; generate, by each of the classifiers, a location classification for each of the social media messages, each location classification comprising a location and a weight associated with that location; and select one of the locations from the location classifications as the location of the given user based on a combination of the weights of the location classifications. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer program product comprising:
-
a non-transitory computer readable storage medium having computer readable program code embodied therewith, the computer readable program code configured to; receive a plurality of social media messages generated by a given user;
extract a plurality of location features from the social media messages;process each of the location features with at least one classifier from an ensemble of classifiers, wherein processing each of the location features comprises determining, by a binary classifier associated with one of the classifiers in the ensemble of classifiers, if a location associated with a given user is predictable by the on classifier; and preventing the one classifier from generating the location classification if the binary classifier determines that the location is not predictable by the one classifier; generate, by each of the classifiers, a location classification for each of the social media messages, each location classification comprising a location and a weight associated with that location; and select one of the locations from the location classifications as the location of the given user based on a combination of the weights of the location classifications. - View Dependent Claims (8, 9, 10, 11, 12)
-
Specification