Message rendering for identification of content features
First Claim
1. A system that facilitates message filtering, comprising:
- A computer readable storage medium comprising;
a pre-rendering component that receives a message and renders the message into a first format, the received message including one or more compressed images, for each of the one or more compressed images a first hash is generated, wherein the one or more compressed images are rendered in an uncompressed mode, for each of the one or more compressed images rendered in the uncompressed mode a second hash is generated, the first and second hashes are compared to determine if the message contains junk indicia;
a converting component that converts the message in the first format into a character-only message anda filtering component that processes the character-only message for predetermined content and routes the received message based upon the predetermined content.
2 Assignments
0 Petitions
Accused Products
Abstract
Architecture for detecting and removing obfuscating clutter from the subject and/or body of a message, e.g., e-mail, prior to filtering of the message, to identify junk messages commonly referred to as spam. The technique utilizes the powerful features built into an HTML rendering engine to strip the HTML instructions for all non-substantive aspects of the message. Pre-processing includes pre-rendering of the message into a final format, which final format is that which is displayed by the rendering engine to the user. The final format message is then converted to a text-only format to remove graphics, color, non-text decoration, and spacing that cannot be rendered as ASCII-style or Unicode-style characters. The result is essentially to reduce each message to its common denominator essentials so that the junk mail filter can view each message on an equal basis.
-
Citations
33 Claims
-
1. A system that facilitates message filtering, comprising:
A computer readable storage medium comprising; a pre-rendering component that receives a message and renders the message into a first format, the received message including one or more compressed images, for each of the one or more compressed images a first hash is generated, wherein the one or more compressed images are rendered in an uncompressed mode, for each of the one or more compressed images rendered in the uncompressed mode a second hash is generated, the first and second hashes are compared to determine if the message contains junk indicia; a converting component that converts the message in the first format into a character-only message and a filtering component that processes the character-only message for predetermined content and routes the received message based upon the predetermined content. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
21. A method of filtering a message, comprising:
-
pre-rendering the message into a final format, the message including one or more compressed images, for each of the one or more compressed images a first hash is generated, wherein the one or more compressed images are rendered in an uncompressed mode, for each of the one or more compressed images rendered in the uncompressed mode a second hash is generated, the first and second hashes are compared to determine if the message contains junk indicia; converting the message of the final format into a text-only message; processing the text-only message for predetermined content; and routing the message based upon the predetermined content. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
Specification