METHODS AND SYSTEMS OF CLASSIFYING SPAM URL
First Claim
1. A method comprising:
- identifying a feature dimension of a user action on a social networking system to detect anomalies;
extracting URL chunks from content associated with the user action;
aggregating a non-content feature of the user action along the feature dimension into a URL distribution store to produce a feature distribution for each of the URL chunks;
determining whether the feature distribution of a particular URL chunk within the URL chunks exceeds an expectation threshold for the feature dimension; and
classifying the particular URL chunk as an illegitimate URL when the feature distribution exceeds the expectation threshold to restrict access to a particular URL chunk on a social networking system.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of operation of a URL spam detection system includes: identifying a feature dimension of a user action on a social networking system to detect anomalies; extracting URL chunks from a content associated with the user action; aggregating a non-content feature of the user action along the feature dimension into a URL distribution store to produce a feature distribution for each of the URL chunks; determining whether the feature distribution of a particular URL chunk within the URL chunks exceeds an expectation threshold for the feature dimension; and classifying the particular URL chunk as an illegitimate URL when the feature distribution exceeds the expectation threshold to restrict access to a particular URL chunk on a social networking system.
23 Citations
20 Claims
-
1. A method comprising:
-
identifying a feature dimension of a user action on a social networking system to detect anomalies; extracting URL chunks from content associated with the user action; aggregating a non-content feature of the user action along the feature dimension into a URL distribution store to produce a feature distribution for each of the URL chunks; determining whether the feature distribution of a particular URL chunk within the URL chunks exceeds an expectation threshold for the feature dimension; and classifying the particular URL chunk as an illegitimate URL when the feature distribution exceeds the expectation threshold to restrict access to a particular URL chunk on a social networking system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method comprising:
-
identifying a feature dimension of a user action on a social networking system, the feature dimension related to a user account responsible for the user action; extracting URL chunks from content associated with the user action; aggregating a sender feature of the user action along the feature dimension into a URL distribution store to produce a feature distribution for each of the URL chunks; detecting an anomaly in the feature distribution of a particular URL chunk within the URL chunks as compared to an expected distribution along the feature dimension; and raising a suspicion level of the particular URL chunk when the anomaly is detected. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A processor-based system comprising:
-
a feature collector module stored on a non-transitory memory, when executed by a processor is configured to; identify a feature dimension of a user action on a social networking system, the feature dimension related to a user account responsible for the user action; extract URL chunks from content associated with the user action; aggregate a sender feature of the user action along the feature dimension into a URL distribution store to produce a feature distribution for each of the URL chunks; and a URL classifier module stored on a non-transitory memory, when executed by a processor is coupled to the feature collection module via the URL distribution store and configured to; detect an anomaly in the feature distribution of a particular URL chunk within URL chunks as compared to an expected distribution; and raise a suspicion level of the particular URL chunk when the anomaly is detected.
-
Specification