METHOD AND SYSTEM FOR SUMMARIZING EMAILS AND EXTRACTING TASKS
First Claim
Patent Images
1. A method for summarizing emails, comprising:
- parsing, by an email zone identifier module, different zones in which an email is organized;
processing text within the email, the processing comprising;
tokenizing the text;
lemmatizing the text;
tagging the text with parts of speech;
analyzing syntactic dependencies within the text; and
extracting named entities;
classifying, by a speech act classifier module, each sentence of the text into one of a plurality of speech acts;
classifying, by a questions classifier module, sentences of the text that contain a question;
summarizing, by a conversational email summarizer module, the email into a compact and coherent summary, the summarizing comprising;
ranking each sentence in the text with an importance score, and adding sentences to an initial email summary that have an importance score surpassing a threshold; and
applying a set of post-processing rules to sentences in the initial email summary after ranking each sentence in the text to yield a final email summary; and
outputting the final email summary of the email.
2 Assignments
0 Petitions
Accused Products
Abstract
The disclosed invention performs a set of operations on an email to analyze the text and generate a coherent summary. Email summaries are generated by applying a coherence layer after a ranking process. Analyzing how sentences relate to each other via discourse markers and other linguistic devices aids in enhancing coherence of the email summaries. Output summaries are more coherent and easier to understand because they mimic the flow of ideas contained in the original message instead of merely being a collection of extracted sentences. Tasks may also be extracted from the text of the email to assist users in keeping track of tasks that they receive via email.
-
Citations
22 Claims
-
1. A method for summarizing emails, comprising:
-
parsing, by an email zone identifier module, different zones in which an email is organized; processing text within the email, the processing comprising; tokenizing the text; lemmatizing the text; tagging the text with parts of speech; analyzing syntactic dependencies within the text; and extracting named entities; classifying, by a speech act classifier module, each sentence of the text into one of a plurality of speech acts; classifying, by a questions classifier module, sentences of the text that contain a question; summarizing, by a conversational email summarizer module, the email into a compact and coherent summary, the summarizing comprising; ranking each sentence in the text with an importance score, and adding sentences to an initial email summary that have an importance score surpassing a threshold; and applying a set of post-processing rules to sentences in the initial email summary after ranking each sentence in the text to yield a final email summary; and outputting the final email summary of the email. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for summarizing emails, the system comprising:
-
one or more processors, memory having instructions stored thereon, which when executed by the one or more processors, cause the processors to perform the following actions; parsing different zones in which an email is organized; tokenizing text within the email; lemmatizing the text; tagging the text with parts of speech; analyzing syntactic dependencies within the text; extracting named entities from the text; classifying each sentence of the text into one of a plurality of speech acts; classifying sentences of the text that contain a question; summarizing the email into a compact and coherent summary, the summarizing comprising; ranking each sentence in the text with an importance score, and adding sentences to an initial email summary that have an importance score surpassing a threshold; and applying a set of post-processing rules to sentences in the initial email summary after ranking each sentence in the text to yield a final email summary; and outputting the final email summary of the email. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification