MESSAGE RENDERING FOR IDENTIFICATION OF CONTENT FEATURES
First Claim
1. A message filtering system, comprising:
- a computer-readable storage medium, comprising;
a pre-rendering component that receives a message containing unrendered text and a non-text element intended to thwart a junk filter by varying the unrendered text, and renders the message in a user-perceivable format;
a converting component for converting the rendered message into a text-only message, mitigating the affect of the non-text element; and
a filtering component that filters the text-only message based upon predetermined content related to a degree of visibility of user-perceivable content of the rendered message, the filtering component including a weighting component that adjusts a weighting parameter associated with text of the text-only message to determine if the text should be removed,wherein filtering entails a determination of whether to handle the message as a junk message or a legitimate message.
2 Assignments
0 Petitions
Accused Products
Abstract
Architecture for detecting and removing obfuscating clutter from the subject and/or body of a message, e.g., e-mail, prior to filtering of the message, to identify junk messages commonly referred to as spam. The technique utilizes the powerful features built into an HTML rendering engine to strip the HTML instructions for all non-substantive aspects of the message. Pre-processing includes pre-rendering of the message into a final format, which final format is that which is displayed by the rendering engine to the user. The final format message is then converted to a text-only format to remove graphics, color, non-text decoration, and spacing that cannot be rendered as ASCII-style or Unicode-style characters. The result is essentially to reduce each message to its common denominator essentials so that the junk mail filter can view each message on an equal basis.
101 Citations
28 Claims
-
1. A message filtering system, comprising:
-
a computer-readable storage medium, comprising; a pre-rendering component that receives a message containing unrendered text and a non-text element intended to thwart a junk filter by varying the unrendered text, and renders the message in a user-perceivable format; a converting component for converting the rendered message into a text-only message, mitigating the affect of the non-text element; and a filtering component that filters the text-only message based upon predetermined content related to a degree of visibility of user-perceivable content of the rendered message, the filtering component including a weighting component that adjusts a weighting parameter associated with text of the text-only message to determine if the text should be removed, wherein filtering entails a determination of whether to handle the message as a junk message or a legitimate message. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for preparing a message for filtering, comprising:
a computer-readable storage medium, comprising; means for receiving a message containing unrendered text and a non-text element intended to thwart a junk filter by varying the unrendered text; means for pre-rendering the message into a first format corresponding to content intended to be user perceived; means for adjusting a weighting parameter associated with text of the message to determine if the text should be removed related to a degree of visibility of the user-perceivable content of the rendered message; means for converting the message of the first format into a character-only message; and means for a determination of whether to handle the message as a junk message or a legitimate message based on the character-only message. - View Dependent Claims (11, 12, 13, 14, 15)
-
16. A method of message pre-processing for a filter, comprising:
-
receiving a message containing unrendered text and a non-text element intended to thwart a junk filter by varying the unrendered text; pre-rendering the message into a first format corresponding to content intended to be user perceived, mitigating the affect of the non-text element; adjusting a weighting parameter associated with text of the message to determine if the text should be removed related to a degree of visibility of the user-perceivable content of the rendered message; converting the message of the first format into a character-only message; and determining whether to handle the message as a junk message or a legitimate message based on the character-only message. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
Specification