Summarization of email on a client computing device based on content contribution to an email thread using classification and word frequency considerations
First Claim
1. A method comprising:
- classifying, by a computing device executing an email application, a baseline set of email messages received and sent during a predefined baseline period of time, wherein each email message in the baseline set is classified as either important or not-important;
wherein a given email message in the baseline set is more likely to be classified as important rather than not-important, when the given email message was addressed personally to a user of the computing device executing the email application, and not to a group comprising the user;
by the computing device executing the email application, generating a first word-frequency table comprising words in email messages classified as important within the baseline set, wherein each word in the first word-frequency table is associated with a measure of frequency of occurrence of the word in the email messages classified as important;
by the computing device executing the email application, generating a second word-frequency table comprising words in email messages classified as not-important within the baseline set, which are not in the first word-frequency table, wherein each word in the second word-frequency table is associated with a measure of frequency of occurrence of the word in the email messages classified as not-important;
by the computing device executing the email application, generating a content summary of an email-thread comprising a plurality of email messages which are distinct from and created after the baseline set of email messages, wherein the content summary of the email-thread is generated based on, for each email message in the email-thread;
(i) an email-weight assigned to the email message in the email-thread, based at least in part on a proximity of the email message in the email-thread to a chronological median of the email-thread, and(ii) a word-weight assigned to some words in the email message in the email-thread, wherein for any given word the word-weight is based at least in part on;
(a) a measure of frequency of the given word obtained from one of the first word-frequency table and the second word-frequency table,(b) whether the given word is in the first word-frequency table versus in the second word-frequency table, and(c) the email-weight of the email message in the email-thread comprising the given word.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods, and computer-readable media are disclosed for enhancing an email application to automatically analyze an email thread and generate a compact content summary. The content summary is based on relative content contributions provided by the constituent email messages in the email thread. The content summary may be presented in a special window without disturbing or modifying the email thread or its constituent email messages. The distinctive content summary disclosed herein comprises certain sentences that are automatically gleaned from the email thread, analyzed relative to other sentences, and presented in a chronological sequence so that the user can quickly determine what the email thread is about and/or the current status of the conversation. The content summary is based on email weights, word weights, and intersecting sentence pairs.
128 Citations
20 Claims
-
1. A method comprising:
-
classifying, by a computing device executing an email application, a baseline set of email messages received and sent during a predefined baseline period of time, wherein each email message in the baseline set is classified as either important or not-important; wherein a given email message in the baseline set is more likely to be classified as important rather than not-important, when the given email message was addressed personally to a user of the computing device executing the email application, and not to a group comprising the user; by the computing device executing the email application, generating a first word-frequency table comprising words in email messages classified as important within the baseline set, wherein each word in the first word-frequency table is associated with a measure of frequency of occurrence of the word in the email messages classified as important; by the computing device executing the email application, generating a second word-frequency table comprising words in email messages classified as not-important within the baseline set, which are not in the first word-frequency table, wherein each word in the second word-frequency table is associated with a measure of frequency of occurrence of the word in the email messages classified as not-important; by the computing device executing the email application, generating a content summary of an email-thread comprising a plurality of email messages which are distinct from and created after the baseline set of email messages, wherein the content summary of the email-thread is generated based on, for each email message in the email-thread; (i) an email-weight assigned to the email message in the email-thread, based at least in part on a proximity of the email message in the email-thread to a chronological median of the email-thread, and (ii) a word-weight assigned to some words in the email message in the email-thread, wherein for any given word the word-weight is based at least in part on; (a) a measure of frequency of the given word obtained from one of the first word-frequency table and the second word-frequency table, (b) whether the given word is in the first word-frequency table versus in the second word-frequency table, and (c) the email-weight of the email message in the email-thread comprising the given word. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A non-transitory computer-readable medium storing instructions that, when executed by a computing device comprising one or more processors and computer-readable memory, cause the computing device to perform operations comprising:
-
executing an email application; classifying a baseline set of email messages received and sent during a predefined baseline period of time, wherein each email message in the baseline set is classified as either important or not-important; wherein a given email message in the baseline set is more likely to be classified as important rather than not-important when the given email message was addressed personally to a user of the computing device executing the email application, and not to a group comprising the user; generating a first word-frequency table comprising words in email messages classified as important within the baseline set, wherein each word in the first word-frequency table is associated with a measure of frequency of occurrence of the word in the email messages classified as important; generating a second word-frequency table comprising words in email messages classified as not-important within the baseline set, which are not in the first word-frequency table, wherein each word in the second word-frequency table is associated with a measure of frequency of occurrence of the word in the email messages classified as not-important; generating a content summary of an email-thread comprising a plurality of email messages which are distinct from and created after the baseline set of email messages, wherein the content summary of the email-thread is generated based on, for each email message in the email-thread; (i) an email-weight assigned to the email message in the email-thread, based at least in part on a proximity of the email message in the email-thread to a chronological median of the email-thread, and (ii) a word-weight assigned to some words in the email message in the email-thread, wherein for any given word the word-weight is based at least in part on; (a) a measure of frequency of the given word obtained from one of the first word-frequency table and the second word-frequency table, and (b) whether the given word is in the first word-frequency table versus in the second word-frequency table, and (c) the email-weight of the email message in the email-thread comprising the given word. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A computing device comprising one or more processors and computer-readable memory, wherein the computing device is configured to:
-
execute an email application; classify a baseline set of email messages received and sent during a predefined baseline period of time, wherein each email message in the baseline set is classified as either important or not-important; wherein a given email message in the baseline set is more likely to be classified as important rather than not-important when the given email message in the baseline set was addressed personally to a user of the computing device and not to a group comprising the user; generate a first word-frequency table comprising words in email messages classified as important within the baseline set, wherein each word in the first word-frequency table is associated with a measure of frequency of occurrence of the word in the email messages classified as important; generate a second word-frequency table comprising words in email messages classified as not-important within the baseline set, which are not in the first word-frequency table, wherein each word in the second word-frequency table is associated with a measure of frequency of occurrence of the word in the email messages classified as not-important; generate a content summary of an email-thread comprising a plurality of email messages which are distinct from and created after the baseline set of email messages, wherein the content summary of the email-thread is generated based on, for each email message in the email-thread; (i) an email-weight assigned to the email message in the email-thread, based at least in part on a proximity of the email message in the email-thread to a chronological median of the email-thread, and (ii) a word-weight assigned to some words in the email message in the email-thread, wherein for any given word the word-weight is based on one or more measures of frequency of the given word in one or more email messages in the email-thread. - View Dependent Claims (18, 19, 20)
-
Specification