Method and apparatus for filtering email spam using email noise reduction
First Claim
Patent Images
1. A method, comprising:
- a computer system detecting, in an email message, one or more character references added to the email message to avoid spam filtering, wherein each character reference specifies a position of a character within a first character set;
the computer system modifying content of the email message, including by converting at least one of the one or more character references to a character corresponding to the specified position within the first character set; and
the computer system comparing the modified content of the email message with content of a spam message;
wherein each of the one or more character references is an HTML character reference of the form “
&
#<
num>
”
, wherein <
num>
is a value that specifies a position of a character within the first character set.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system for filtering email spam using email noise reduction are described. In one embodiment, the method includes detecting, in an email message, data indicative of noise added to the email message to avoid spam filtering. The method further includes modifying the content of the email message to reduce the noise, and comparing the modified content of the email message with the content of a spam message.
-
Citations
22 Claims
-
1. A method, comprising:
-
a computer system detecting, in an email message, one or more character references added to the email message to avoid spam filtering, wherein each character reference specifies a position of a character within a first character set; the computer system modifying content of the email message, including by converting at least one of the one or more character references to a character corresponding to the specified position within the first character set; and the computer system comparing the modified content of the email message with content of a spam message; wherein each of the one or more character references is an HTML character reference of the form “
&
#<
num>
”
, wherein <
num>
is a value that specifies a position of a character within the first character set. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system, comprising:
-
one or more processors; a memory having stored therein program instructions executable by the one or more processors to; detect, in an email message, one or more character references added to the email message to avoid spam filtering, wherein each character reference specifies a position of a character within a first character set; modify content of the email message, including by converting at least one of the one or more character references to a character corresponding to the specified position within the first character set; compare the modified content of the email message with content of a spam message; and wherein each of the one or more character references is an HTML character reference of the form “
&
#<
num>
”
, wherein <
num>
is a value that specifies a position of a character within the first character set. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A non-transitory tangible computer-readable medium having stored thereon program instructions executable by a computer system to:
-
detect, in an email message, one or more character references added to the email message to avoid spam filtering, wherein each character reference specifies a position of a character within a first character set; modify content of the email message, including by converting at least one of the one or more character references to a character corresponding to the specified position within the first character set; compare the modified content of the email message with content of a spam message; and wherein each of the one or more character references is an HTML character reference of the form “
&
#<
num>
”
, wherein <
num>
is a value that specifies a position of a character within the first character set. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A non-transitory tangible computer-readable medium having stored thereon program instructions that are computer executable to:
-
detect, in an electronic message, one or more character references, wherein each character reference specifies a position of a character within a first character set; modify content of the electronic message, including by converting at least one of the one or more character references to a character corresponding to the specified position within the first character set; and compare the modified content of the electronic message with a spam message; wherein each of the one or more character references is an HTML character reference of the form “
&
#<
num>
”
, wherein <
num>
is a value that specifies a position of a character within the first character set. - View Dependent Claims (19)
-
-
20. An apparatus, comprising:
-
a processor; a memory having stored therein program instructions executable by the processor to; detect, in an electronic message, one or more character references, wherein each character reference specifies a position of a character within a first character set; modify content of the electronic message, including by converting at least one of the one or more character references to a character corresponding to the specified position within the first character set; and compare the modified content of the electronic message with a spam message; wherein each of the one or more character references is an HTML character reference of the form “
&
#<
num>
”
, wherein <
num>
is a value that specifies a position of a character within the first character set. - View Dependent Claims (21, 22)
-
Specification