Filtering Electronic Messages
First Claim
Patent Images
1. A computer-implemented method for filtering electronic messages, the method comprising:
- receiving an electronic message for transmission to a recipient;
generating a fingerprint for the electronic message, the fingerprint being a fixed length of appended bits selected from hash values determined from a plurality of hash functions applied to separate textual words included in the electronic message;
determining if the electronic message is associated with a known cluster of previously transmitted electronic messages; and
filtering the electronic message based on the determining.
3 Assignments
0 Petitions
Accused Products
Abstract
Technologies are described herein for filtering of electronic messages. A method for filtering messages includes receiving an electronic message for transmission to a recipient, generating a fingerprint for the electronic message, determining if the electronic message is associated with a known cluster of previously transmitted electronic messages, and filtering the electronic message based on the determining. The fingerprint is a fixed length of appended bits selected from hash values determined by applying hash functions to separate textual words included in the electronic message.
59 Citations
20 Claims
-
1. A computer-implemented method for filtering electronic messages, the method comprising:
-
receiving an electronic message for transmission to a recipient; generating a fingerprint for the electronic message, the fingerprint being a fixed length of appended bits selected from hash values determined from a plurality of hash functions applied to separate textual words included in the electronic message; determining if the electronic message is associated with a known cluster of previously transmitted electronic messages; and filtering the electronic message based on the determining. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-readable storage medium having computer executable instructions stored thereon which, when executed by a computer, cause the computer to:
-
receive an electronic message for transmission to a recipient; generate a fingerprint for the electronic message, the fingerprint being a fixed length of appended bits selected from hash values determined from a plurality of hash functions applied to separate textual words included in the electronic message; determine if the electronic message is associated with a known cluster of previously transmitted electronic messages; classify the known cluster based on message features of the known cluster in response to determining the electronic message is associated with the known cluster; and publish an electronic mail filter configured to filter future messages received based on the classification and the known cluster. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A mail processing system configured to distribute electronic messages from a plurality of client computers to a plurality of recipients, the system comprising:
-
at least one computer executing an electronic messaging service configured to receive the electronic messages from the plurality of client computers, the electronic messaging service further configured to divide each message into a plurality of shingles absent noisy characters, perform a plurality of hash functions on each shingle of the plurality of shingles to create a plurality of hash values associated with each shingle, and generate a message fingerprint for each message based on the plurality of hash functions; at least one computer executing a clustering service configured to receive each message fingerprint from the electronic messaging service, the clustering service further configured to, divide each fingerprint into a plurality of bit sequences, compare each bit sequence of the plurality of bit sequences to an associated bin of bit sequences for known clusters of previously transmitted electronic messages, and determine if a similarity threshold between each fingerprint and the known clusters has been met; and at least one computer executing a filtering agent configured to filter the electronic messages based on filter information received from the clustering service.
-
Specification