Statistical message classifier
First Claim
1. A method for improving a statistical message classifier, the method comprising:
- receiving a message over a network communication interface;
identifying by a processor executing instructions out of a memory a feature in the message that is associated with a junk count;
testing the message with a machine classifier by the processor performing a calculation according to a logarithmic function including the junk count, wherein the machine classifier outputs a value for the statistical message classifier for storage in the memory;
updating the statistical message classifier stored in the memory according to the logarithmic calculation made by the machine classifier, wherein the stored statistical message classifier corresponds to a probability that the feature in the message is spam;
identifying that the message is spam based on the probability that the feature in the message is spam; and
quarantining the message.
25 Assignments
0 Petitions
Accused Products
Abstract
A system and method are disclosed for improving a statistical message classifier. A message may be tested with a machine classifier, wherein the machine classifier is capable of making a classification on the message. In the event the message is classifiable by the machine classifier, the statistical message classifier is updated according to the reliable classification made by the machine classifier. The message may also be tested with a first classifier. In the event that the message is not classifiable by the first classifier, it is tested with a second classifier, wherein the second classifier is capable of making a second classification. In the event that the message is classifiable by the second classifier, the statistical message classifier is updated according to the second classification.
-
Citations
28 Claims
-
1. A method for improving a statistical message classifier, the method comprising:
-
receiving a message over a network communication interface; identifying by a processor executing instructions out of a memory a feature in the message that is associated with a junk count; testing the message with a machine classifier by the processor performing a calculation according to a logarithmic function including the junk count, wherein the machine classifier outputs a value for the statistical message classifier for storage in the memory; updating the statistical message classifier stored in the memory according to the logarithmic calculation made by the machine classifier, wherein the stored statistical message classifier corresponds to a probability that the feature in the message is spam; identifying that the message is spam based on the probability that the feature in the message is spam; and quarantining the message. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method for improving a statistical message classifier comprising:
-
receiving a message over the network communication interface; identifying by a processor executing instructions out of a memory a feature in the message that is associated with a junk count; testing message with a first classifier by the processor performing a calculation according to a logarithmic function including the junk count, wherein the first classifier outputs a value for the statistical message classifier for storage in the memory and a first classification; in the event that the message is classifiable by the first classifier, updating the statistical message classifier according to the logarithmic calculation; in the event that the first classifier does not make the first classification, testing the message with a second classifier, wherein the second classifier is capable of making a second classification by the processor; and updating the statistical message storage in the memory classifier according to the second classification by the processor, wherein the statistical message corresponds to a probability that the feature in the message is spam; identifying that the message is not spam based on the probability that the feature in the message is spam; and delivering the message to a recipient based on identifying that the message is not spam. - View Dependent Claims (19)
-
-
20. A method for improving a statistical message classifier comprising:
-
receiving a message over a network communication interface; identifying by a processor executing instructions out of a memory a feature in the message that is associated with a good count; testing the message with a first classifier wherein the first classifier is a reliable good classifier capable of making a first classification by the processor performing a calculation according to a logarithmic function including the good count, wherein the first classifier outputs a value for the statistical message classifier for storage in the memory; in the event that the message is classifiable by the first classifier, updating the statistical message classifier stored in the memory according to the first classification by the processor performing the logarithmic calculation; in the event that the first classifier does not make the first classification, testing the message with a second classifier, wherein the second classifier is capable of making a second classification; and updating the statistical message classifier stored in the memory according to the second classification by the processor, wherein the statistical message classifier corresponds to a probability that the feature in the message is spam; identifying that the message is not spam based on the probability that the feature in the message is spam; and delivering the message to a recipient based on identifying that the message is not spam. - View Dependent Claims (22)
-
-
21. A method for improving a statistical message classifier comprising:
-
receiving a message over a network communication interface; identifying by a processor executing instructions out of a memory a feature in the message that is associated with a junk count; testing the message with a first classifier by the processor, wherein the first classifier is a reliable junk classifier capable of making a first classification by the processor performing a calculation according to a logarithmic function including the junk count, wherein the first classifier outputs a value for the statistical message classifier for storage in the memory; in the event that the message is classifiable by the first classifier, updating the statistical message classifier stored in the memory according to the first classification by the processor performing the logarithmic calculation; in the event that the first classifier does not make the first classification, testing the message with a second classifier, wherein the second classifier is capable of making a second classification; and updating the statistical message classifier stored in the memory according to the second classification by the processor, wherein the statistical message classifier corresponds to a probability that the feature in the message is spam; identifying that the message is spam based on the probability that the feature in the message is spam; and quarantining the message. - View Dependent Claims (23, 24)
-
-
25. A system for classifying a message, comprising:
-
a memory; a processor; and a network interface, wherein; a message is received over the network communication interface; the processor executing instructions out of the memory; identifies a feature in the message that is associated with a junk count, wherein a statistical message classifier stored in the memory corresponds to a probability that the feature in the message is spam, and tests the message with a message classifier by performing a calculation according to a logarithmic function including the junk count, machine classifier coupled to the statistical message classifier, the machine classifier configured to test the message and outputs a value for the statistical message classifier stored in the memory, and the statistical message classifier is updated according to the logarithmic calculation by the machine classifier and corresponds to a probability that the feature in the message is spam; identifies that the message is spam based on the probability that the feature in the message is spam; and quarantines the message.
-
-
26. A system for improving a statistical message classifier, comprising:
-
a memory; a processor; and a network interface, wherein; a message is received over the network communication interface; the processor executing instructions out of the memory; identifies a feature in the message that is associated with a junk count tests the message using a first classifier making a first classification according to a calculation according to a logarithmic function including the junk count, the first classifier, the first classifier further outputs a value for the statistical message classifier for storage in the memory and updates the statistical message classifier according to the logarithmic calculation, wherein the statistical message corresponds to a probability that the feature in the message is spam; and second classifier coupled to the first classifier, capable of reliably making a second classification, and configured to further test the message in the event that the message is not classifiable by the first classifier; identifies that the message is spam based on the probability that the feature in the message is spam; and quarantines the message.
-
-
27. A non-transitory computer readable storage medium having embodied thereon a program, the program being executable by a processor to perform a method for improving a statistical message classifier, the method comprising:
-
receiving a message over a network communication interface; identifying by a processor executing instructions out of a memory a feature in the message that is associated with a junk count; testing the message with a machine classifier by the processor performing a calculation according to a logarithmic function including the junk count, wherein the machine classifier is capable of making a classification of the message; and updating the statistical message stored in the memory classifier according to the reliable classification made by the machine classifier, wherein the statistical message classifier stored in the memory corresponds to a probability that the feature in the message is spam; identifying that the message is spam based on the probability that the feature in the message is spam; and quarantining the message.
-
-
28. A non-transitory computer readable storage medium having embodied thereon a program, the program being executable to perform a method for improving a statistical message classifier, the method comprising:
-
receiving a message over a network communication interface; identifying by a processor executing instructions out of a memory a feature in the message that is associated with a junk count testing the message with a fist classifier wherein the first classifier by the processor performing a calculation according to a logarithmic function including the junk count; in the event that the first classifier makes the classification, updating the statistical message classifier stored in the memory according to the first classification, wherein the statistical message classifier stored in the memory corresponds to a probability that the feature in the message is spam; in the event that the first classifier does not make the classification, testing the message with a second classifier, wherein the second classifier is capable of reliably making a second classification; and updating the statistical message classifier stored in the memory according to the second classifications; identifying that the message is spam based on the probability that the feature in the message is spam; and quarantining the message.
-
Specification