Spam filtering with probabilistic secure hashes
First Claim
1. One or more program storage mediums readable by at least a computer having at least one memory and at least one processor, the one or more mediums tangibly embodying one or more programs of instructions executable by the at least one computer to implement a signature-based message filtering system that facilitates spam prevention and protection comprising:
- a signature assignment component that determines and assigns one or more signatures for an incoming message based in part on at least one hash function; and
a signature analysis component that determines a probability that at least a subset of the message'"'"'s signatures are indicative of spam based in part on a count of presumed spam signatures that match and a count of the overall volume of messages sent to the system per signature,wherein the signature analysis component determines a complaint rate by taking a product of a percentage of recipient complaints by a number of read messages and dividing the product by an overall volume of incoming messages, and based upon how old the read messages are.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are signature-based systems and methods that facilitate spam detection and prevention at least in part by calculating hash values for an incoming message and then determining a probability that the hash values indicate spam. In particular, the signatures generated for each incoming message can be compared to a database of both spam and good signatures. A count of the number of matches can be divided by a denominator value. The denominator value can be an overall volume of messages sent to the system per signature for example. The denominator value can be discounted to account for different treatments and timing of incoming messages. Furthermore, secure hashes can be generated by combining portions of multiple hashing components. A secure hash can be made from a combination of multiple hashing components or multiple combinations thereof. The signature based system can also be integrated with machine learning systems to optimize spam prevention.
-
Citations
44 Claims
-
1. One or more program storage mediums readable by at least a computer having at least one memory and at least one processor, the one or more mediums tangibly embodying one or more programs of instructions executable by the at least one computer to implement a signature-based message filtering system that facilitates spam prevention and protection comprising:
-
a signature assignment component that determines and assigns one or more signatures for an incoming message based in part on at least one hash function; and a signature analysis component that determines a probability that at least a subset of the message'"'"'s signatures are indicative of spam based in part on a count of presumed spam signatures that match and a count of the overall volume of messages sent to the system per signature, wherein the signature analysis component determines a complaint rate by taking a product of a percentage of recipient complaints by a number of read messages and dividing the product by an overall volume of incoming messages, and based upon how old the read messages are. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A signature-based method that facilitates filtering messages for spam detection by a computer processor comprising:
-
assigning, by the computer processor, one or more signatures to an incoming message based in part upon at least one hash function; determining a probability that at least a subset of the message'"'"'s signatures are indicative of spam based in part on a count of presumed spam signatures that match and a count of the overall volume of messages sent to the system per signature; and determining a complaint rate by taking a product of a percentage of recipient complaints by a number of read messages and dividing the product by an overall volume of incoming messages, and based upon how old the read messages are. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41)
-
-
42. One or more program storage mediums readable by at least a computer having at least one memory and at least one processor, the one or more mediums tangibly embodying one or more programs of instructions executable by the at least one computer to implement a signature-based system that facilitates filtering messages for spam detection comprising:
-
means for assigning one or more signatures to an incoming message based in part upon at least one hash function; and means for determining a probability that at least a subset of the message'"'"'s signatures are indicative of spam based in part on a match count to presumed spam signatures that match and the overall volume of messages sent to the system per signature, wherein a complaint rate is determined by taking a product of a percentage of recipient complaints by a number of read messages and dividing the product by an overall volume of incoming messages, and based upon how old the read messages are. - View Dependent Claims (43)
-
-
44. A computer storage medium having stored thereon a data packet adapted to be transmitted between two or more computer processes facilitating improved detection of spam, the data packet comprising:
- information which when receiving by a computer processor determines a probability that at least a subset of a message'"'"'s signatures are indicative of spam based in part on a match count to the overall volume of messages sent to the system per signature, wherein a complaint rate is determined by taking a product of a percentage of recipient complaints by a number of read messages and dividing the product by an overall volume of incoming messages, and based upon how old the read messages are.
Specification