Signature generation using message summaries
First Claim
Patent Images
1. A method for processing electronic-mail messages, the method comprising:
- storing information in memory regarding a message previously classified as an unsolicited message, the message including a collection of words, wherein the collection of words is not helpful for the purpose of distinguishing the message, the collection of words including a plurality of words that can be found in a dictionary;
executing instructions stored in memory, wherein execution of the instructions by a processor;
identifies from the collection of words one or more dictionary words known to be commonly occurring or known to be associated with spam,removes the identified one or more dictionary words from the message,replaces one or more remaining words in the message with an associated canonical equivalent to generate a resulting summary, andgenerates a signature based on the resulting summary, andstoring the generated signature in memory for use in classifying and processing subsequently received messages, wherein a subsequently received message is classified based on the signature and processed based on the classification.
23 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for processing a message are provided. A message may be processed to generate a message summary by removing or replacing certain words, phrases, sentences, punctuation, and the like. Message signatures based upon the message summary may be generated and stored in a signature database, which may be used to identify and/or classify spam messages. Subsequently received messages may be classified by signature and processed based on classification.
-
Citations
17 Claims
-
1. A method for processing electronic-mail messages, the method comprising:
-
storing information in memory regarding a message previously classified as an unsolicited message, the message including a collection of words, wherein the collection of words is not helpful for the purpose of distinguishing the message, the collection of words including a plurality of words that can be found in a dictionary; executing instructions stored in memory, wherein execution of the instructions by a processor; identifies from the collection of words one or more dictionary words known to be commonly occurring or known to be associated with spam, removes the identified one or more dictionary words from the message, replaces one or more remaining words in the message with an associated canonical equivalent to generate a resulting summary, and generates a signature based on the resulting summary, and storing the generated signature in memory for use in classifying and processing subsequently received messages, wherein a subsequently received message is classified based on the signature and processed based on the classification. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for processing electronic-mail messages, the method comprising:
-
receiving a message sent over a communication network, the message including a collection of words, wherein the collection of words is not helpful for the purpose of distinguishing the message, the collection of words including a plurality of words that can be found in a dictionary; and executing instructions stored in memory, wherein execution of the instructions by a processor; identifies from the collection of words one or more dictionary words known to be commonly occurring or known to be associated with spam, and removes the identified one or more dictionary words from the message to generate a resulting summary, classifies the message based on at least the resulting summary, and processes the message based on at least the classification of the message. - View Dependent Claims (13, 14)
-
-
15. A system for processing a message comprising:
-
memory for storing information regarding a message previously classified as an unsolicited message, the message including a collection of words, wherein the collection of words is not helpful for the purpose of distinguishing the message, the collection of words including a plurality of words that can be found in a dictionary; a message gateway including a summarization and signature generation engine stored in memory, the summarization and signature engine executable by a processor to; identify from the collection of words one or more dictionary words known to be commonly occurring or known to be associated with spam, remove the identified one or more dictionary words in the message, replace one or more remaining words in the message with an associated canonical equivalent to generate a resulting summary, and generate one or more signatures based on at least the resulting summary, wherein the one or more signatures are stored in a database configured to store information concerning the one or more signatures; and a server configured to process subsequently received messages based on at least signature information generated by the message gateway and stored in the database. - View Dependent Claims (16)
-
-
17. A non-transitory computer-readable storage medium having embodied thereon a program, the program being executable by a processor to perform a method for processing a message, the method comprising:
-
storing information regarding a message previously classified as an unsolicited message, the message including a collection of words, wherein the collection of words is not helpful for the purpose of distinguishing the message, the collection of words including a plurality of words that can be found in a dictionary; identifying from the collection of words one or more dictionary words known to be commonly occurring or known to be associated with spam; removing the identified one or more dictionary words from the message; replacing one or more remaining words in the message with an associated canonical equivalent to generate a resulting summary; generating a signature based on the resulting summary; classifying a subsequently received message based on the signature; and processing the subsequently received message based on the classification of the subsequently received message.
-
Specification