Method and apparatus to identify outliers in social networks
First Claim
1. A method, comprising:
- reducing, by a system comprising a processor, a sampling size of a total population of social network users of an on-line social network based on a comparison of seed information to a population of the social network users to obtain a reduced population of social network users of the on-line network;
sampling, by the system, the reduced population of the social network users utilizing a Metropolized randomized walk algorithm to obtain first sampled social network users;
sampling, by the system, the reduced population of the social network users utilizing a crawl algorithm that includes non-active users of the on-line social network to obtain second sampled social network users;
generating, by the system, a social network graph based on the first sampled social network users and the second sampled social network users, wherein the social network graph comprises an arrangement of the first sampled social network users and the second social network users based on relationships between members of the first sampled social network users and the second social network users arising in the on-line social network;
identifying, by the system, a cluster of social network users within the arrangement of the first sampled social network users and the second social network users of the social network graph;
characterizing, by the system, the reduced sampling of the social network users based on the cluster of the social network users of the social network graph; and
identifying, by the system, an outlier in the reduced sampling of the social network users based on the characterizing of the social network users, and based on collateral information combined with an intra-graph property of the social network graph, wherein the outlier does not conform to the social network graph.
1 Assignment
0 Petitions
Accused Products
Abstract
A system that incorporates teachings of the present disclosure may include, for example, a process that reduces a sampling size of a total population of on-line social network users based on a comparison of seed information to a population of on-line social network users. The reduced sampling of on-line social network users is compared to a social graph of the on-line social network users, wherein the social graph is obtained from an algorithm applied to the reduced sampling of the on-line social network users. An outlier is determined in the reduced sampling of on-line social network users based on relationships between the on-line social network users. The determining of the outlier is also based on one of ambient or publicly available information combined with an intra-graph property of the on-line social network users, wherein the outlier does not conform to the social graph. Additional embodiments are disclosed.
-
Citations
20 Claims
-
1. A method, comprising:
- reducing, by a system comprising a processor, a sampling size of a total population of social network users of an on-line social network based on a comparison of seed information to a population of the social network users to obtain a reduced population of social network users of the on-line network;
sampling, by the system, the reduced population of the social network users utilizing a Metropolized randomized walk algorithm to obtain first sampled social network users;
sampling, by the system, the reduced population of the social network users utilizing a crawl algorithm that includes non-active users of the on-line social network to obtain second sampled social network users;
generating, by the system, a social network graph based on the first sampled social network users and the second sampled social network users, wherein the social network graph comprises an arrangement of the first sampled social network users and the second social network users based on relationships between members of the first sampled social network users and the second social network users arising in the on-line social network;
identifying, by the system, a cluster of social network users within the arrangement of the first sampled social network users and the second social network users of the social network graph;
characterizing, by the system, the reduced sampling of the social network users based on the cluster of the social network users of the social network graph; and
identifying, by the system, an outlier in the reduced sampling of the social network users based on the characterizing of the social network users, and based on collateral information combined with an intra-graph property of the social network graph, wherein the outlier does not conform to the social network graph. - View Dependent Claims (2, 3, 4, 5, 6)
- reducing, by a system comprising a processor, a sampling size of a total population of social network users of an on-line social network based on a comparison of seed information to a population of the social network users to obtain a reduced population of social network users of the on-line network;
-
7. A device, comprising:
-
a memory that stores executable instructions; and a processor coupled to the memory, wherein execution of the instructions by the processor facilitates performance of operations comprising; obtaining a reduced sampling of social network users of a population of social network users of an on-line social network based on a comparison of seed information to the population of social network users; sampling the reduced population of the social network users utilizing a Metropolized randomized walk algorithm to obtain first sampled social network users; sampling the reduced population of the social network users utilizing a crawl algorithm that includes non-active users of the on-line social network to obtain second sampled social network users; generating a social graph based on the first sampled social network users and the second sampled social network users, wherein the social graph comprises an arrangement of the first sampled social network users and the second social network users based on relationships between members of the first sampled social network users and the second social network users arising in the on-line social network;
identifying a cluster of social network users within the arrangement of the first sampled social network users and the second social network users of the social graph;characterizing the reduced sampling of the social network users based on the cluster of the social network users of the social network users of the social graph; and identifying an outlier in the reduced sampling of the social network users based on the characterizing of the reduced sampling of the social network users, and based on collateral information combined with an intra-graph property of the social graph, wherein the outlier does not conform to the social graph. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. A non-transitory, machine-readable storage medium, comprising executable instructions that, when executed by a processor, facilitate performance of operations comprising:
-
reducing a sampling size of a population of social network users of an on-line social network based on a comparison of seed information to the population of the social network users to obtain a reduced sampling of the social network users;
sampling the reduced population of the social network users utilizing a Metropolized randomized walk algorithm to obtain first sampled social network users;sampling the reduced population of the social network users utilizing a crawl algorithm that includes non-active users of the on-line social network to obtain second sampled social network users; generating a social graph based on the first sampled social network users and the second sampled social network users, wherein the social graph comprises an arrangement of the first sampled social network users and the second social network users based on relationships between members of the first sampled social network users and the second social network users arising in the on-line social network; identifying a cluster of social network users within the arrangement of the first sampled social network users and the second social network users of the social graph; characterizing the reduced sampling of the social network users based on the cluster of the social network users of the social network users of the social graph; and determining an outlier in the reduced sampling of the social network users based on the characterizing of the reduced sampling of the social network users, and based on one of ambient or publicly available information combined with an intra-graph property of the social network users, wherein the outlier does not conform to the social graph. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification