Identification of content by metadata
First Claim
Patent Images
1. A method for filtering messages, the method comprising:
- receiving a first electronic message via a network interface, the first electronic message including a first document file;
extracting a first metadata dataset characterizing the first document file;
retrieving a second metadata dataset from a database, the second metadata dataset characterizing a second document file included in a second electronic message;
identifying that the first metadata dataset matches the second metadata dataset within a previously specified margin of error, wherein the previously specified margin of error represents a previously specified range of variations between the first metadata dataset and the second metadata dataset; and
classifying the first electronic message as spam in response to the identification that the first metadata dataset matches the second metadata dataset within the previously specified margin of error.
16 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for identifying content in electronic messages are provided. An electronic message may include certain content. The content is detected and analyzed to identify any metadata. The metadata may include a numerical signature characterizing the content. A thumbprint is generated based on the numerical signature. The thumbprint may then be compared to thumbprints of previously received messages. The comparison allows for classification of the electronic message as spam or not spam.
51 Citations
20 Claims
-
1. A method for filtering messages, the method comprising:
-
receiving a first electronic message via a network interface, the first electronic message including a first document file; extracting a first metadata dataset characterizing the first document file; retrieving a second metadata dataset from a database, the second metadata dataset characterizing a second document file included in a second electronic message; identifying that the first metadata dataset matches the second metadata dataset within a previously specified margin of error, wherein the previously specified margin of error represents a previously specified range of variations between the first metadata dataset and the second metadata dataset; and
classifying the first electronic message as spam in response to the identification that the first metadata dataset matches the second metadata dataset within the previously specified margin of error. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for filtering messages, the system comprising:
-
a communication transceiver that receives a first electronic message over a communication network, the first electronic message including a first document file; a memory that stores instructions; and a processor, wherein execution of the instructions by the processor causes the system to; extract a first metadata dataset characterizing the first document file, retrieve a second metadata dataset from a database, the second metadata dataset characterizing a second document file included in a second electronic message, identify that the first metadata dataset matches the second metadata dataset within a previously specified margin of error, wherein the previously specified margin of error represents a previously specified range of variations between the first metadata dataset and the second metadata dataset, and classify the first electronic message as spam in response to the identification that the first metadata dataset matches the second metadata dataset within the previously specified margin of error. - View Dependent Claims (16, 17, 18)
-
-
19. A method for filtering audiovisual content, the method comprising:
-
receiving a first electronic message via a network interface, the first electronic message including a first video file; extracting a first metadata dataset characterizing the first video file; retrieving a second metadata dataset from a database, the second metadata dataset characterizing a second video file included in a second electronic message; identifying that the first metadata dataset matches the second metadata dataset within a previously specified margin of error, wherein the previously specified margin of error represents a previously specified range of variations between the first metadata dataset and the second metadata dataset; and classifying the first electronic message as spam in response to the identification that the first metadata dataset matches the second metadata dataset within the previously specified margin of error. - View Dependent Claims (20)
-
Specification