ONLINE ADAPTIVE FILTERING OF MESSAGES

US 20140344387A1
Filed: 08/05/2014
Published: 11/20/2014
Est. Priority Date: 07/21/2003
Status: Active Grant

First Claim

Patent Images

1-15. -15. (canceled)

View all claims

7 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

In general, a two or more stage spam filtering system is used to filter spam in an e-mail system. One stage includes a global e-mail classifier that classifies e-mail as it enters the e-mail system. The parameters of the global e-mail classifier generally may be a determined by the policies of e-mail system owner and generally are set to only classify as spam those e-mails that are likely to be considered spam by a significant number of users of the e-mail system. Another stage includes personal e-mail classifiers at the individual mailboxes of the e-mail system users. The parameters of the personal e-mail classifiers generally are set by the users through retraining, such that the personal e-mail classifiers are refined to track the subjective perceptions of their respective user as to what e-mails are spam e-mails. Retraining data for the personal e-mail classifiers may be aggregated and a subset of the aggregate may be chosen for use in retraining the global e-mail classifier.

8 Citations

View as Search Results

58 Claims

1-15. -15. (canceled)

16. A method of operating a spam filtering system in a messaging system that includes a message gateway and individual message boxes for users of the system, the method comprising:
- aggregating personal retraining data used to retrain personal, scoring e-mail classifiers that classify messages delivered to the individual message boxes as spam when a score for the messages exceeds a first threshold for classifying the messages as spam, wherein personal retraining data for an individual message box is based on a user'"'"'s feedback about the classes of messages in the user'"'"'s individual message box;
  
  selecting a subset of the aggregated personal retraining data as global retraining data for retraining a global, scoring e-mail classifier that classifies messages received at a message gateway as spam when a score for the messages exceeds a second threshold for classifying the messages as spam, the second threshold being higher than the first threshold; and
  
  retraining the global, scoring e-mail classifier based on the global retraining data so as to adjust which messages are classified as spam.
- View Dependent Claims (17, 18, 19, 20, 21, 23, 24, 25, 26, 27, 28, 29)
- - 17. The method of claim 16 wherein the user feedback is explicit.
  - 18. The method of claim 17 wherein the explicit user feedback comprises one or more of the following:
    - a user reporting a message as spam;
      
      moving a message from an Inbox folder in the individual message box to a Spam folder in the individual message box;
      
      or moving a message from an Spam folder in the individual message box to a Inbox folder in the individual message box.
  - 19. The method of claim 16 wherein the feedback is implicit.
  - 20. The method of claim 19 wherein the implicit feedback comprises one or more of the following:
    - keeping a message as new after the message has been read;
      
      forwarding a message;
      
      replying to a message;
      
      printing a message;
      
      adding a sender of a message to an address book;
      
      or not explicitly changing a classification of a message.
  - 21. The method of claim 16 wherein the aggregated personal retraining data comprises messages.
  - 23. The method of claim 16 wherein the feedback comprises changing a message'"'"'s class.
  - 24. The method of claim 23 wherein selecting a subset of the aggregated personal retraining data comprises selecting a message as global retraining data when a particular number of users change the message'"'"'s classification.
  - 25. The method of claim 16 wherein the messages are e-mails.
  - 26. The method of claim 16 wherein the messages are e-instant messages.
  - 27. The method of claim 16 wherein the messages are SMS messages.
  - 28. The method of claim 16 wherein, to classify a message, the global, scoring e-mail classifier uses an internal model to determine a probability measure for the message and compares the probability measure to a classification threshold.
  - 29. The method of claim 28 wherein, to classify a message, the personal, scoring e-mail classifier uses an internal model to determine a probability measure for the message and compares the probability measure to a classification threshold, the method further comprising initializing the personal, scoring e-mail classifier'"'"'s internal model using the internal model for the global, scoring e-mail classifier.

22. (canceled)

30-43. -43. (canceled)

44. A non-transitory computer-usable medium storing a computer program for operating a spam filtering system in a messaging system that includes a message gateway and individual message boxes for users of the system, the computer program comprising instructions for causing at least one processor to:
- aggregate personal retraining data used to retrain personal, scoring e-mail classifiers that classify messages delivered to the individual message boxes as spam when a score for the messages exceeds a first threshold for classifying the messages as spam, wherein personal retraining data for an individual message box is based on a user'"'"'s feedback about the classes of messages in the user'"'"'s individual message box;
  
  select a subset of the aggregated personal retraining data as global retraining data for retraining a global, scoring e-mail classifier that classifies messages received at a message gateway as spam when a score for the messages exceeds a second threshold for classifying the messages as spam, the second threshold being higher than the first threshold; and
  
  retrain the global, scoring e-mail classifier based on the global retraining data so as to adjust which messages are classified as spam.
- View Dependent Claims (45, 46, 47, 48, 49, 51, 52, 53, 54, 55, 56)
- - 45. The medium of claim 44 wherein the user feedback is explicit.
  - 46. The medium of claim 45 wherein the explicit user feedback comprises one or more of the following:
    - a user reporting a message as spam;
      
      moving a message from an Inbox folder in the individual message box to a Spam folder in the individual message box;
      
      or moving a message from an Spam folder in the individual message box to a Inbox folder in the individual message box.
  - 47. The medium of claim 44 wherein the feedback is implicit.
  - 48. The medium of claim 47 wherein the implicit feedback comprises one or more of the following:
    - keeping a message as new after the message has been read;
      
      forwarding a message;
      
      replying to a message;
      
      printing a message;
      
      adding a sender of a message to an address book;
      
      or not explicitly changing a classification of a message.
  - 49. The medium of claim 44 wherein the aggregated personal retraining data comprises messages.
  - 51. The medium of claim 44 wherein the feedback comprises changing a message'"'"'s class.
  - 52. The medium of claim 51 wherein to select a subset of the aggregated personal retraining data, the computer program further comprises instructions for causing a processor to select a message as global retraining data when a particular number of users change the message'"'"'s classification.
  - 53. The medium of claim 44 wherein the messages are e-mails.
  - 54. The medium of claim 44 wherein the messages are e-instant messages.
  - 55. The medium of claim 44 wherein the messages are SMS messages.
  - 56. The medium of claim 44 wherein, to classify a message, the global, scoring e-mail classifier uses an internal model to determine a probability measure for the message and compares the probability measure to a classification threshold.

50. (canceled)

57. (canceled)

58. An apparatus for operating a spam filtering system in a messaging system that includes a message gateway and individual message boxes for users of the system, the apparatus comprising:
- a network interface configured to receive personal retraining data for an individual message box used to retrain personal, scoring e-mail classifiers that classify messages delivered to the individual message boxes as spam when a score for the messages exceeds a first threshold for classifying the messages as spam, wherein the personal retraining data is based on a user'"'"'s feedback about the classes of messages in the user'"'"'s individual message box over one or more network connections; and
  
  at least one processor configured by a set of instructions to (i) aggregate the received personal retraining data, (ii) select a subset of the aggregated personal retraining data as global retraining data for retraining a global, scoring e-mail classifier that classifies messages received at a message gateway as spam when a score for the messages exceeds a second threshold for classifying the messages as spam, the second threshold being higher than the first threshold, and (iii) retrain the global, scoring e-mail classifier based on the global retraining data so as to adjust which messages are classified as spam.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Yahoo Assets LLC
Original Assignee
AOL Inc. (Apollo Global Management, Inc.)
Inventors
ALSPECTOR, Joshua, KOLCZ, Aleksander

Granted Patent

US 9,270,625 B2
Time in Patent Office

Days
Field of Search
US Class Current

709/206
CPC Class Codes

G06F 16/353   into predefined classes

G06Q 10/107   Computer-aided management o...

H04L 51/212   using filtering or selectiv...

H04L 51/48   Message addressing, e.g. ad...

ONLINE ADAPTIVE FILTERING OF MESSAGES

First Claim

7 Assignments

0 Petitions

Accused Products

Abstract

8 Citations

58 Claims

Specification

Use Cases

Quick Links

Others

ONLINE ADAPTIVE FILTERING OF MESSAGES

First Claim

7 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

8 Citations

58 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others