Systems and methods for generating a message topic training dataset from user interactions in message clients
First Claim
1. A method of classifying messages, the method comprising:
- at a computer system having one or more processors, and memory storing one or more programs configured for execution by the one or more processors;
classifying, according to a first message classifier, each message in a first plurality of messages, thereby independently identifying a respective original message category in a set of message categories for each message in the first plurality of messages;
delivering the first plurality of messages to a plurality of recipients with a designation of the respective original message category of each message in the first plurality of messages;
collecting a plurality of recipient initiated message interaction events for messages in the first plurality of messages from the plurality of recipients, including a plurality of priority designation assignments and a plurality of message category change events in which a recipient has changed the category of a message from the original message category assigned by the first message classifier to an updated message category in the set of message categories;
identifying a subset of the first plurality of messages as having trusted message category assignments based on the plurality of recipient initiated message interaction events;
constructing a message categorization dataset from the identified subset, comprising, for each message from the identified subset, information identifying the message, information corresponding to the currently assigned message category for the message, and information corresponding to a level of confidence in the message category for the message;
training a second message classifier using the message categorization dataset;
after training the second message classifier, classifying each message in a second plurality of messages using the second message classifier; and
delivering the second plurality of messages to the plurality of recipients with a designation of the respective message category of each message in the second plurality of messages, as determined by the second message classifier;
wherein;
the message categorization dataset comprises a subset of the first plurality of messages;
the classifying is performed by the first message classifier that was constructed at a time prior to the classifying;
the plurality of recipient initiated message interaction events include events in which the recipient changes the category of a message from the original message category assigned by the first message classifier to an updated message category in the set of message categories; and
the constructing includes selecting, for the subset of the first plurality of messages, those messages in the first plurality of messages that have undergone a recipient initiated message interaction event.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for classifying messages are provided. Each message in a plurality of messages is classified, thereby independently identifying a message category in a set of message categories for each respective message in the plurality of messages. The plurality of messages is delivered to a plurality of recipients with a designation of the message category of each respective message in the first plurality of messages. A plurality of recipient initiated message interaction events for messages in the first plurality of messages over a predetermined period of time is collected from the plurality of recipients. A message categorization dataset is then constructed from (i) the first plurality of messages, (ii) the designation of the message category of each respective message in the subset of the first plurality of messages, and (iii) the plurality of recipient initiated message interaction events. This message categorization dataset is used to train or evaluate a message classifier.
7 Citations
16 Claims
-
1. A method of classifying messages, the method comprising:
- at a computer system having one or more processors, and memory storing one or more programs configured for execution by the one or more processors;
classifying, according to a first message classifier, each message in a first plurality of messages, thereby independently identifying a respective original message category in a set of message categories for each message in the first plurality of messages; delivering the first plurality of messages to a plurality of recipients with a designation of the respective original message category of each message in the first plurality of messages; collecting a plurality of recipient initiated message interaction events for messages in the first plurality of messages from the plurality of recipients, including a plurality of priority designation assignments and a plurality of message category change events in which a recipient has changed the category of a message from the original message category assigned by the first message classifier to an updated message category in the set of message categories; identifying a subset of the first plurality of messages as having trusted message category assignments based on the plurality of recipient initiated message interaction events; constructing a message categorization dataset from the identified subset, comprising, for each message from the identified subset, information identifying the message, information corresponding to the currently assigned message category for the message, and information corresponding to a level of confidence in the message category for the message; training a second message classifier using the message categorization dataset; after training the second message classifier, classifying each message in a second plurality of messages using the second message classifier; and delivering the second plurality of messages to the plurality of recipients with a designation of the respective message category of each message in the second plurality of messages, as determined by the second message classifier; wherein; the message categorization dataset comprises a subset of the first plurality of messages; the classifying is performed by the first message classifier that was constructed at a time prior to the classifying; the plurality of recipient initiated message interaction events include events in which the recipient changes the category of a message from the original message category assigned by the first message classifier to an updated message category in the set of message categories; and the constructing includes selecting, for the subset of the first plurality of messages, those messages in the first plurality of messages that have undergone a recipient initiated message interaction event. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 16)
- at a computer system having one or more processors, and memory storing one or more programs configured for execution by the one or more processors;
-
14. A computing system, comprising:
-
one or more processors; memory storing one or more programs to be executed by the one or more processors, wherein the one or more programs comprise instructions for; classifying, according to a first message classifier, each message in a first plurality of messages, thereby independently identifying a respective original message category in a set of message categories for each respective message in the first plurality of messages; delivering the first plurality of messages to a plurality of recipients with a designation of the respective original message category of each message in the first plurality of messages; collecting a plurality of recipient initiated message interaction events for messages in the first plurality of messages from the plurality of recipients, including a plurality of priority designation assignments and a plurality of message category change events in which a recipient has changed the category of a message from the original message category assigned by the first message classifier to an updated message category in the set of message categories; identifying a subset of the first plurality of messages as having trusted message category assignments based on the plurality of recipient initiated message interaction events; constructing a message categorization dataset from the identified subset, comprising, for each message from the identified subset, information identifying the message, information corresponding to the currently assigned message category for the message, and information corresponding to a level of confidence in the message category for the message; training a second message classifier using the message categorization dataset; after training the second message classifier, classifying each message in a second plurality of messages using the second message classifier; and delivering the second plurality of messages to the plurality of recipients with a designation of the respective message category of each message in the second plurality of messages, as determined by the second message classifier; wherein; the message categorization dataset comprises a subset of the first plurality of messages; the classifying is performed by the first message classifier that was constructed at a time prior to the classifying; the plurality of recipient initiated message interaction events include events in which the recipient changes the category of a message from the original message category assigned by the first message classifier to an updated message category in the set of message categories; and the constructing includes selecting, for the subset of the first plurality of messages, those messages in the first plurality of messages that have undergone a recipient initiated message interaction event.
-
-
15. A non-transitory computer readable storage medium storing one or more programs configured for execution by a computer, the one or more programs comprising instructions for:
-
classifying, according to a first message classifier, each message in a first plurality of messages, thereby independently identifying a respective original message category in a set of message categories for each respective message in the first plurality of messages; delivering the first plurality of messages to a plurality of recipients with a designation of the respective original message category of each message in the first plurality of messages; collecting a plurality of recipient initiated message interaction events for messages in the first plurality of messages from the plurality of recipients, including a plurality of priority designation assignments and a plurality of message category change events in which a recipient has changed the category of a message from the original message category assigned by the first message classifier to an updated message category in the set of message categories; identifying a subset of the first plurality of messages as having trusted message category assignments based on the plurality of recipient initiated message interaction events; constructing a message categorization dataset from the identified subset, comprising, for each message from the identified subset, information identifying the message, information corresponding to the currently assigned message category for the message, and information corresponding to a level of confidence in the message category for the message; training a second message classifier using the message categorization dataset; after training the second message classifier, classifying each message in a second plurality of messages using the second message classifier; and delivering the second plurality of messages to the plurality of recipients with a designation of the respective message category of each message in the second plurality of messages, as determined by the second message classifier; wherein; the message categorization dataset comprises a subset of the first plurality of messages; the classifying is performed by the first message classifier that was constructed at a time prior to the classifying; the plurality of recipient initiated message interaction events include events in which the recipient changes the category of a message from the original message category assigned by the first message classifier to an updated message category in the set of message categories; and the constructing includes selecting, for the subset of the first plurality of messages, those messages in the first plurality of messages that have undergone a recipient initiated message interaction event.
-
Specification