×

Email analysis using fuzzy matching of text

  • US 7,644,127 B2
  • Filed: 03/09/2005
  • Issued: 01/05/2010
  • Est. Priority Date: 03/09/2004
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for analyzing character codes in text of a message to determine a probability that the message is spam, the method comprising:

  • (a) receiving a message that includes character codes in text of the message;

    (b) identifying character codes of the text that are likely being used to obfuscate a word or phrase of the text;

    (c) deobfuscating each word or phrase of the text that is identified at step (b) as likely being obfuscated, to produce deobfuscated text; and

    (d) determining an extent that the character codes of the text are identified as likely being used to obfuscate a word or phrase of the text by determining one or more of the followinga quantity of words or phrases identified at step (b) as likely being obfuscated;

    which particular words or phrases are identified at step (b) as likely being obfuscated, and(e) analyzing the deobfuscated text by comparing the deobfuscated text to text of one or more other messages known to be spam;

    (f) determining a probability that the message is spam based on both(i) results of the analyzing the deobfuscated text, by comparing the deobfuscated text to text of one or more other messages known to be spam, performed at step (e), and(ii) results of the determining the extent that the character codes of the text are likely being used to obfuscate a word or phrase of the text, performed at step (d);

    wherein at least steps (b), (c), (d), (e) and (f) are performed by one or more processors.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×