Identification of content by metadata
First Claim
Patent Images
1. A method for identifying content in electronic messages, the method comprising:
- receiving a first electronic message over a communication network;
extracting metadata characterizing one or more images in the first electronic message, the extracted metadata comprising a plurality of numerical values, each numerical value characterizing a different aspect of the one or more images;
adjusting a subset comprising one or more of the plurality of numerical values from the extracted metadata, wherein each numerical value in the subset is adjusted by a numerical difference value that falls within a pre-specified range;
generating a variation thumbprint based on the extracted metadata, the variation thumbprint based on the adjusted numerical values, wherein the variation thumbprint is stored in association with the extracted metadata from the first electronic message;
searching for one or more thumbprints matching the variation thumbprint of the first electronic message; and
classifying one or more electronic messages associated with the matching thumbprints, the electronic messages classified based on a classification of the first electronic message.
20 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for identifying content in electronic messages are provided. An electronic message may include certain content. The content is detected and analyzed to identify any metadata. The metadata may include a numerical signature characterizing the content. A thumbprint is generated based on the numerical signature. The thumbprint may then be compared to thumbprints of previously received messages. The comparison allows for classification of the electronic message as spam or not spam.
24 Citations
18 Claims
-
1. A method for identifying content in electronic messages, the method comprising:
-
receiving a first electronic message over a communication network; extracting metadata characterizing one or more images in the first electronic message, the extracted metadata comprising a plurality of numerical values, each numerical value characterizing a different aspect of the one or more images; adjusting a subset comprising one or more of the plurality of numerical values from the extracted metadata, wherein each numerical value in the subset is adjusted by a numerical difference value that falls within a pre-specified range; generating a variation thumbprint based on the extracted metadata, the variation thumbprint based on the adjusted numerical values, wherein the variation thumbprint is stored in association with the extracted metadata from the first electronic message; searching for one or more thumbprints matching the variation thumbprint of the first electronic message; and classifying one or more electronic messages associated with the matching thumbprints, the electronic messages classified based on a classification of the first electronic message. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for identifying content in electronic messages, the system comprising:
-
a hardware processor coupled to a memory, wherein the processor executes; a content filter executable to detect one or more images within a first electronic message received over a communication network, a metadata analyzer executable to extract metadata characterizing the one or more images within the first electronic message, the extracted metadata comprising a plurality of numerical values, each numerical value characterizing a different aspect of the one or more images, a thumbprint generator executable to; adjust a subset comprising one or more of the plurality of numerical values from the extracted metadata, wherein each numerical value in the subset is adjusted by a numerical difference value that falls within a pre-specified range, generate a variation thumbprint based on the extracted metadata, and search for thumbprints matching the variation thumbprint of the first electronic message, and classify one or more electronic messages associated with the matching thumbprints, the electronic messages classified based on a classification of the first electronic message; and wherein the memory stores the variation thumbprint in association with the extracted metadata from the first electronic message. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A non-transitory computer-readable storage medium, having embodied thereon a program executable by a processor to perform a method for identifying content in electronic messages, the method comprising:
-
receiving a first electronic message over a communication network; extracting metadata characterizing one or more images in the first electronic message, the extracted metadata comprising a plurality of numerical values, each numerical value characterizing a different aspect of the one or more images; adjusting a subset comprising one or more of the plurality of numerical values from the extracted metadata, wherein each numerical value in the subset is adjusted by a numerical difference value that falls within a pre-specified range; generating a variation thumbprint based on the extracted metadata, the variation thumbprint based on the adjusted numerical values, wherein the variation thumbprint is stored in association with the extracted metadata from the first electronic message, searching for thumbprints matching the variation thumbprint of the first electronic message; and classifying one or more electronic messages associated with the matching thumbprints, the electronic messages classified based on a classification of the first electronic message.
-
Specification