Trees of classifiers for detecting email spam
First Claim
1. A system that facilitates classification of electronic mail, comprising:
- a feature detection component that detects at least one feature of at least one obtained email; and
a decision tree classification component that classifies the obtained email based on, at least in part, the features of the obtained email and at least one feature classifier employed via a decision tree.
2 Assignments
0 Petitions
Accused Products
Abstract
Decision trees populated with classifier models are leveraged to provide enhanced spam detection utilizing separate email classifiers for each feature of an email. This provides a higher probability of spam detection through tailoring of each classifier model to facilitate in more accurately determining spam on a feature-by-feature basis. Classifiers can be constructed based on linear models such as, for example, logistic-regression models and/or support vector machines (SVM) and the like. The classifiers can also be constructed based on decision trees. “Compound features” based on internal and/or external nodes of a decision tree can be utilized to provide linear classifier models as well. Smoothing of the spam detection results can be achieved by utilizing classifier models from other nodes within the decision tree if training data is sparse. This forms a base model for branches of a decision tree that may not have received substantial training data.
191 Citations
20 Claims
-
1. A system that facilitates classification of electronic mail, comprising:
-
a feature detection component that detects at least one feature of at least one obtained email; and
a decision tree classification component that classifies the obtained email based on, at least in part, the features of the obtained email and at least one feature classifier employed via a decision tree. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 20)
-
-
10. A method for facilitating classification of electronic mail, comprising:
-
obtaining an email from a source; and
utilizing a decision tree of classifier models to detect whether the email is spam. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 19)
-
-
18. A system that facilitates classification of electronic mail, comprising:
-
means for detecting features of at least one obtained email; and
means for classifying the obtained email based on, at least in part, the features of the obtained email and at least one feature classifier employed via a decision tree.
-
Specification