Anti-spam transient entity classification
First Claim
1. A method, comprising:
- receiving, by a processor, a first e-mail message associated with an IP address of a mail server that sent the first e-mail message;
concurrently fetching, by the processor, a plurality of features associated with the IP address, the plurality of features including one or more fast features and one or more of full features, wherein a fast feature comprises a feature that fetches in less than a predefined time threshold, and wherein a full feature fetches in greater than the predefined time threshold;
classifying, by the processor, the IP address with a fast classifier based on the one or more fast features, the fast classifier further classifying a user based on information obtained during user registration with an on-line entity;
determining, by the fast classifier, that the mail server is a spammer;
caching, by the processor, a result of the fast classifier in an entity classification cache;
receiving, by the processor, a second e-mail message associated with the IP address;
performing, by the processor, an action on the second e-mail message based on the cached result of the fast classifier, the action performed on the second-email message comprising an action type selected from a group of action types consisting of discarding the second e-mail message, delivering the second e-mail message to a spam box, and not delivering the second e-mail message;
classifying, by the processor, the IP address with a full classifier based on the one or more full features, the full classifier further classifying the user based on information derived from online user activity; and
caching, by the processor, a result of the full classifier in the entity classification cache.
6 Assignments
0 Petitions
Accused Products
Abstract
Embodiments are directed towards multi-level entity classification. An object associated with an entity is received. In one embodiment the object comprises and email and the entity comprises the IP address of a sending email server. If the entity has already been classified, as indicated by an entity classification cache, then a corresponding action is taken on the object. However, if the entity has not been classified, the entity is submitted to a fast classifier for classification. A feature collector concurrently fetches available features, including fast features and full features. The fast classifier classifies the entity based on the fast features, storing the result in the entity classification cache. Subsequent objects associated with the entity are processed based on the cached result of the fast classifier. Then, a full classifier classifies the entity based on at least the full features, storing the result in the entity classification cache.
98 Citations
16 Claims
-
1. A method, comprising:
-
receiving, by a processor, a first e-mail message associated with an IP address of a mail server that sent the first e-mail message; concurrently fetching, by the processor, a plurality of features associated with the IP address, the plurality of features including one or more fast features and one or more of full features, wherein a fast feature comprises a feature that fetches in less than a predefined time threshold, and wherein a full feature fetches in greater than the predefined time threshold; classifying, by the processor, the IP address with a fast classifier based on the one or more fast features, the fast classifier further classifying a user based on information obtained during user registration with an on-line entity; determining, by the fast classifier, that the mail server is a spammer; caching, by the processor, a result of the fast classifier in an entity classification cache; receiving, by the processor, a second e-mail message associated with the IP address; performing, by the processor, an action on the second e-mail message based on the cached result of the fast classifier, the action performed on the second-email message comprising an action type selected from a group of action types consisting of discarding the second e-mail message, delivering the second e-mail message to a spam box, and not delivering the second e-mail message; classifying, by the processor, the IP address with a full classifier based on the one or more full features, the full classifier further classifying the user based on information derived from online user activity; and caching, by the processor, a result of the full classifier in the entity classification cache. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computing device, comprising:
-
a processor; a storage medium for tangibly storing thereon program logic for execution by the processor, the stored program logic comprising; first e-mail message receiving logic executed by the processor for receiving a first e-mail message associated with an IP address of a mail server that sent the first e-mail message; fetching logic executed by the processor for concurrently fetching a plurality of features associated with the IP address, the plurality of features including one or more fast features and one or more of full features, wherein a fast feature comprises a feature that fetches in less than a predefined time threshold, and wherein a full feature fetches in greater than the predefined time threshold; fast classifier classifying logic executed by the processor for classifying the IP address with a fast classifier based on the one or more fast features, the fast classifier further classifying a user based on information obtained during user registration with an on-line entity; spammer determining logic executed by the processor for determining, by the fast classifier, that the mail server is a spammer; caching logic executed by the processor for caching a result of the fast classifier in an entity classification cache; second e-mail message receiving logic executed by the processor for receiving a second e-mail message associated with the IP address; action performing logic executed by the processor for performing an action on the second e-mail message based on the cached result of the fast classifier, the action performed on the second-email message comprising an action type selected from a group of action types consisting of discarding the second e-mail message, delivering the second e-mail message to a spam box, and not delivering the second e-mail message; full classifier classifying logic executed by the processor for classifying the IP address with a full classifier based on the one or more full features, the full classifier further classifying the user based on information derived from online user activity; and result caching logic executed by the processor for caching a result of the full classifier in the entity classification cache. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A non-transitory computer-readable storage medium comprising computer-executable instructions for execution by a processor, the instructions comprising:
-
receiving a first e-mail message associated with an IP address of a mail server that sent the first e-mail message; concurrently fetching a plurality of features associated with the IP address, the plurality of features including one or more fast features and one or more of full features, wherein a fast feature comprises a feature that fetches in less than a predefined time threshold, and wherein a full feature fetches in greater than the predefined time threshold; classifying the IP address with a fast classifier based on the one or more fast features, the fast classifier further classifying a user based on information obtained during user registration with an on-line entity; determining, by the fast classifier, that the mail server is a spammer; caching a result of the fast classifier in an entity classification cache; receiving a second e-mail message associated with the IP address; performing an action on the second e-mail message based on the cache result of the fast classifier, the action performed on the second-email message comprising an action type selected from a group of action types consisting of discarding the second e-mail message, delivering the second e-mail message to a spam box, and not delivering the second e-mail message; classifying the IP address with a full classifier based on the one or more full features, the full classifier further classifying the user based on information derived from online user activity; and caching a result of the full classifier in the entity classification cache. - View Dependent Claims (13, 14, 15, 16)
-
Specification