Classifying knowledge aging in emails using Naïve Bayes Classifier
First Claim
1. A method to be performed on a computing device, the computing device comprising a central processing unit and a memory, the method comprising:
- using the central processing unit to obtain one or more counts generated by a statistical classifier, the one or more counts being based on historical verified data, comprising one or more data entries, that is consumed by the statistical classifier;
selecting one or more aging factors reflecting a correlation between the historical verified data'"'"'s age and its relevance to new data, wherein the selecting the one or more aging factors comprises providing a user interface for selecting the one or more aging factors;
aging the one or more counts with the one or more aging factors;
classifying new data using the statistical classifier employing the aged one or more counts.
2 Assignments
0 Petitions
Accused Products
Abstract
The Naïve Bayes Classifier predicts the classification of a set of data based on the features of that data and a series of counts reflecting the information obtained from prior data sets, with one count per feature per class. An external boost can be applied to the counts generated by the NBC to account for external information. Such a boost is added to the counts generated by the NBC, and the boosted counts are then used by the NBC. A boost can be applied to some or all of the counts and the boost for each count can be applied independently. Likewise, the counts can be periodically aged by multiplying the counts with an aging factor of between 0 and 1 per period. Aging factors can be applied uniformly across all counts, or can be individually applied, enabling some counts to age more than others.
-
Citations
11 Claims
-
1. A method to be performed on a computing device, the computing device comprising a central processing unit and a memory, the method comprising:
-
using the central processing unit to obtain one or more counts generated by a statistical classifier, the one or more counts being based on historical verified data, comprising one or more data entries, that is consumed by the statistical classifier; selecting one or more aging factors reflecting a correlation between the historical verified data'"'"'s age and its relevance to new data, wherein the selecting the one or more aging factors comprises providing a user interface for selecting the one or more aging factors; aging the one or more counts with the one or more aging factors; classifying new data using the statistical classifier employing the aged one or more counts. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A statistical classification system comprising:
-
a central processing unit coupled to a system memory, the system memory comprising; a statistical model comprising a series of counts, with each count counting a number of occurrences of a feature value per class amongst a set of data entries; a statistical classifier for generating the statistical model and using the statistical model to classify new data entries; and a count modification mechanism, external to the statistical classifier, for modifying the series of counts, wherein the count modification mechanism comprises a user interface for viewing or selecting modification data, wherein the count modification mechanism comprises a boosting mechanism for selecting one or more boost values for one or more counts of the series of counts, and, an aging mechanism for selecting one or more aging factors for one or more counts of the series of counts. - View Dependent Claims (9, 10, 11)
-
Specification