Advanced URL and IP features
First Claim
Patent Images
1. A spam detection system comprising:
- a component that receives an item and extracts a set of features associated with an origination of a message or part thereof and/or information that enables an intended recipient to contact, respond to, or act on the message, the features comprising at least one of IP address-based features and URL-based features;
an analysis component that analyzes at least a subset of the features; and
at least one filter that is trained on at least a subset of the features to facilitate distinguishing spam messages from good messages.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are systems and methods that facilitate spam detection and prevention at least in part by building or training filters using advanced IP address and/or URL features in connection with machine learning techniques. A variety of advanced IP address related features can be generated from performing a reverse IP lookup. Similarly, many different advanced URL based features can be created from analyzing at least a portion of any one URL detected in a message.
176 Citations
43 Claims
-
1. A spam detection system comprising:
-
a component that receives an item and extracts a set of features associated with an origination of a message or part thereof and/or information that enables an intended recipient to contact, respond to, or act on the message, the features comprising at least one of IP address-based features and URL-based features;
an analysis component that analyzes at least a subset of the features; and
at least one filter that is trained on at least a subset of the features to facilitate distinguishing spam messages from good messages. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 30)
-
-
11. A spam detection and filtering system comprising:
-
a component that uses traceroute to gather additional information about at least one message; and
a filtering component that employs the traceroute information to facilitate distinguishing between spam and good messages. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A spam detection and filtering system comprising:
-
a component that receives an incoming message; and
a filter that employs a combination of URL-based inputs detected in a message to facilitate determining whether the message is spam. - View Dependent Claims (18, 19, 20, 21)
-
-
22. A spam detection and filtering system comprising:
-
a component that receives an incoming message;
a component that detects URLs and redirected URLs; and
a machine learning filter that employs at least a portion of one or more redirected URLs detected in a message as inputs to facilitate determining whether the message is spam. - View Dependent Claims (23, 24, 25)
-
-
26. A spam detection and filtering system comprising:
-
a component that detects URLs in a message;
a contact process component comprising at least one of the following contact routes;
URL detected in the message including at least one of an IP address of the URL, a DNS server of the URL, a traceroute of the IP address of the host of the URL, an IP address of the DNS server of the URL, version information of the DNS server, and the traceroute of the IP address of the DNS server; and
a filter component that employs at least one of the contact routes to facilitate determining whether the message is spam. - View Dependent Claims (27, 28, 29)
-
-
31. A spam filtering method comprising:
-
extracting at least one of IP address-based data and URL-based data from a message;
generating at least one of IP address-based features and the URL-based features from the respective data to be used as inputs to at least one filter; and
employing at least one filter trained on at least a subset of the inputs to facilitate distinguishing spam messages from good messages. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38, 39)
-
-
40. A spam detection and filtering method comprising:
-
receiving incoming messages;
examining a contact process of obtaining data from a URL to determine commonalities among a plurality of hostnames to facilitate generating features; and
employing at least one filter trained at least in part on at least a subset of the features to facilitate determining whether messages are spam. - View Dependent Claims (41)
-
-
42. A spam filtering system comprising:
-
means for extracting at least one of IP address-based data and URL-based data from a message;
means for generating at least one of IP address-based features and the URL-based features from the respective data to be used as inputs to at least one filter; and
means for employing at least one filter trained on at least a subset of the inputs to facilitate distinguishing spam messages from good messages.
-
-
43. A data packet adapted to be transmitted between two or more computer
processes facilitating improved detection of spam, the data packet comprising: - information associated with generating at least one of IP address-based features and the URL-based features from respective data to be used as inputs to at least one filter; and
employing at least one machine learning filter trained on at least a subset of the inputs to facilitate distinguishing spam messages from good messages.
- information associated with generating at least one of IP address-based features and the URL-based features from respective data to be used as inputs to at least one filter; and
Specification