Identification of content
First Claim
Patent Images
1. A method for identifying content in electronic messages, the method comprising:
- receiving an electronic message having been classified as spam;
executing instructions stored in memory, wherein execution of the instructions by a processor;
detects content included in the received electronic message,identifies metadata for the detected content, the metadata including a numerical signature comprising a concatenation of a plurality of numbers, each number characterizing an aspect of the content,generates at least one variation for the numerical signature, wherein the variation characterizes a plurality of aspects of content that is different from any content in the received electronic message, andgenerates a thumbprint based on the numerical signature and a thumbprint based on the at least one variation; and
storing the thumbprints in a database in memory, the thumbprints being accessible from the database for comparison to a thumbprint of a subsequently received message, wherein the subsequently received message is classified as spam or not spam based on the comparison to the stored thumbprints.
22 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for identifying content in electronic messages are provided. An electronic message may include certain content. The content is detected and analyzed to identify any metadata. The metadata may include a numerical signature characterizing the content. A thumbprint is generated based on the numerical signature. The thumbprint may then be compared to thumbprints of previously received messages. The comparison allows for classification of the electronic message as spam or not spam.
12 Citations
22 Claims
-
1. A method for identifying content in electronic messages, the method comprising:
-
receiving an electronic message having been classified as spam; executing instructions stored in memory, wherein execution of the instructions by a processor; detects content included in the received electronic message, identifies metadata for the detected content, the metadata including a numerical signature comprising a concatenation of a plurality of numbers, each number characterizing an aspect of the content, generates at least one variation for the numerical signature, wherein the variation characterizes a plurality of aspects of content that is different from any content in the received electronic message, and generates a thumbprint based on the numerical signature and a thumbprint based on the at least one variation; and storing the thumbprints in a database in memory, the thumbprints being accessible from the database for comparison to a thumbprint of a subsequently received message, wherein the subsequently received message is classified as spam or not spam based on the comparison to the stored thumbprints. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable storage medium having embodied thereon a program, the program being executable by a processor to perform a method for identifying content in electronic messages, the method comprising:
-
receiving an electronic message classified as spam; detecting content included in the received electronic message; identifying metadata for the detected content, the metadata including a numerical signature comprising a concatenation of a plurality of numbers, each number characterizing an aspect of the content; generating at least one variation for the numerical signature, wherein the variation characterizes a plurality of aspects of content that is different from any content in the received electronic message; generating a thumbprint based on the numerical signature and a thumbprint based on the at least one variation; and storing the thumbprints in a database, the thumbprints being accessible from the database for comparison to a thumbprint of a subsequently received message, wherein the subsequently received message is classified as spam or not spam based on the comparison to the stored thumbprints.
-
-
11. A method for identifying content in electronic messages, the method comprising:
-
receiving an electronic message; and executing instructions stored in memory, wherein execution of the instructions by a processor; detects content included in the received electronic message, identifies metadata for the detected content, the metadata including a numerical signature comprising a concatenation of a plurality of numbers, each number characterizing an aspect of the content, generates at least one variation for the numerical signature, wherein the variation characterizes a plurality of aspects of content that is different from any content in the received electronic message, generates a thumbprint based on the numerical signature of the metadata and a thumbprint based on the variation, and compares the generated thumbprints to thumbprint information stored in a database storing thumbprint information associated with each of a plurality of content found in electronic messages previously received and classified as spam, wherein the electronic message is classified as spam or not spam based on the comparison. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory computer-readable storage medium having embodied thereon a program, the program being executable by a processor to perform a method for identifying content in electronic messages, the method comprising:
-
receiving an electronic message; detecting content included in the received electronic message; identifying metadata for the detected content, the metadata including a numerical signature comprising a concatenation of a plurality of numbers, each number characterizing an aspect of the content; generating at least one variation for the numerical signature, wherein the variation characterizes a plurality of aspects of content that is different from any content in the received electronic message, generating a thumbprint based on the numerical signature of the metadata and a thumbprint based on the variation; and comparing the generated thumbprints to thumbprint information stored in a database storing thumbprint information associated with each of a plurality of content found in electronic messages previously received and classified as spam, wherein the electronic message is classified as spam or not spam based on the comparison.
-
-
21. A system for identifying content in electronic messages, the system comprising:
-
a memory configured to store a database of thumbprint information associated with each of a plurality of content found in electronic messages previously received and classified as spam; an interface configured to receive an electronic message; and a processor configured to execute instructions stored in memory, wherein execution of the instructions; detects content included in the received electronic message, identifies metadata for the detected content, the metadata including a numerical signature comprising a concatenation of a plurality of numbers, each number characterizing an aspect of the content, generates at least one variation for the numerical signature, wherein the variation characterizes a plurality of aspects of content that is different from any content in the received electronic message, generates a thumbprint based on the numerical signature of the metadata and a thumbprint based on the variation, and compares the thumbprint to the thumbprint information stored in the database, wherein the electronic message is classified as spam or not spam based on the comparison. - View Dependent Claims (22)
-
Specification