Adaptive junk message filtering system
First Claim
1. A data filtering system, comprising:
- a first filter configured to tag messages as junk based at least in part on junk information associated with the messages, the first filter having associated therewith a false positive rate and a false negative rate;
one or more second filters configured to tag the messages as junk based at least in part on junk information associated with the messages, the one or more second filters initially associated with the false positive rate and the false negative rate of the first filtera filter output configured to receive tagged and untagged messages from the first filter and the one or more second filters;
a user correction component configured to receive user actions relating to the tagged and untagged messages sent to the filter output and to output false positive data and false negative data based on the user actions relating to the tagged and untagged messages sent to the filter output; and
a filter control configured to;
receive the false positive data and the false negative data;
adjust the false positive rate or the false negative rate or both of at least one of the one or more second filters based on its false positive data or its false negative data or both; and
route subsequently received messages between the first filter and the one or more second filters according to a threshold and their respective false positive rates, false negative rates or both.
2 Assignments
0 Petitions
Accused Products
Abstract
The invention relates to a system for filtering messages—the system includes a seed filter having associated therewith a false positive rate and a false negative rate. A new filter is also provided for filtering the messages, the new filter is evaluated according to the false positive rate and the false negative rate of the seed filter, the data used to determine the false positive rate and the false negative rate of the seed filter are utilized to determine a new false positive rate and a new false negative rate of the new filter as a function of threshold. The new filter is employed in lieu of the seed filter if a threshold exists for the new filter such that the new false positive rate and new false negative rate are together considered better than the false positive and the false negative rate of the seed filter.
169 Citations
19 Claims
-
1. A data filtering system, comprising:
-
a first filter configured to tag messages as junk based at least in part on junk information associated with the messages, the first filter having associated therewith a false positive rate and a false negative rate; one or more second filters configured to tag the messages as junk based at least in part on junk information associated with the messages, the one or more second filters initially associated with the false positive rate and the false negative rate of the first filter a filter output configured to receive tagged and untagged messages from the first filter and the one or more second filters; a user correction component configured to receive user actions relating to the tagged and untagged messages sent to the filter output and to output false positive data and false negative data based on the user actions relating to the tagged and untagged messages sent to the filter output; and a filter control configured to; receive the false positive data and the false negative data; adjust the false positive rate or the false negative rate or both of at least one of the one or more second filters based on its false positive data or its false negative data or both; and route subsequently received messages between the first filter and the one or more second filters according to a threshold and their respective false positive rates, false negative rates or both. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 16, 17, 18)
-
-
12. A method of facilitating data filtering, comprising:
-
automatically filtering incoming messages according to a false positive rate and a false negative rate of a seed filter; receiving user-correction data relating to at least one filtered message; determining an accuracy of the seed filter based on the user-correction data relating to the at least one filtered message; training a new filter using the user-correction data; determining a false positive rate and a false negative rate of the new filter; determining an accuracy of the new filter based on the false positive rate and the false negative rate of the new filter; and employing the new filter in lieu of the seed filter if the accuracy of the new filter is better than that of the seed filter. - View Dependent Claims (13, 14, 15)
-
-
19. A data filtering system, comprising:
-
first means for filtering messages, the first means for filtering messages having associated therewith a false positive rate and a false negative rate; new means for filtering the messages, the new means for filtering the messages trained according to the false positive rate and the false negative rate associated with the first means for filtering the messages; means for determining a new false positive rate and a new false negative rate associated with the new means for filtering the messages as a function of threshold; means for determining a threshold of the new means for filtering the messages; means for employing the new means for filtering the messages in lieu of the first means for filtering the messages if a threshold exists for the new means for filtering the messages such that the new false positive rate and new false negative rate associated with the new means for filtering the messages are together considerd better than the false positive rate and the false negative rate associated with the first means for filtering the messages.
-
Specification