Methods and apparatus for extraction of content from an email or email threads for use in providing implicit profile attributes and content for recommendation engines
First Claim
1. An automated, computerized method for extracting content from an email, comprising:
- removing any replied content from the email based on at least one of known delimiters and known email thread patterns used for separating original and reply emails;
analyzing the email to locate any generated signature patterns for the sender and any known signature patterns, and removing any signature content from the email based on any of the located generated signature patterns for the sender and any of the located known signature patterns;
analyzing the email to locate any generated greeting patterns for the recipient and any known greeting patterns, and removing any greeting content from the email based on any of the located generated greeting patterns for the recipient and any of the located known greeting patterns;
removing any sent from content identifying a device or an email client from which the email was sent based on known sent from content; and
outputting remaining email text for further processing;
wherein said further processing comprises;
analyzing the remaining email text for use in augmenting a sender'"'"'s implicit profile by deriving key words from the remaining email text and storing the key words as a part of the sender'"'"'s implicit profile, the sender'"'"'s implicit profile being located in a recommendation engine database; and
analyzing the remaining email text for question and answer content on specific topics and storing the question and answer content in the recommendation engine database.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus for extracting content from an email or email thread are provided. Any replied content is removed from the email based on at least one of known delimiters and known email thread patterns used for separating original and reply emails. Any signature content is removed based on at least one of generated signature patterns for the sender and known signature patterns. Any greeting content is removed based on at least one of generated greeting patterns for the recipient and known greeting patterns. Any sent from content identifying a device or an email client from which the email was sent is removed based on known sent from content. The remaining email text can then be output for further processing, such as analyzing the text for use in augmenting a sender'"'"'s implicit profile, and analyzing the text for question or answer content on specific topics.
-
Citations
32 Claims
-
1. An automated, computerized method for extracting content from an email, comprising:
-
removing any replied content from the email based on at least one of known delimiters and known email thread patterns used for separating original and reply emails; analyzing the email to locate any generated signature patterns for the sender and any known signature patterns, and removing any signature content from the email based on any of the located generated signature patterns for the sender and any of the located known signature patterns; analyzing the email to locate any generated greeting patterns for the recipient and any known greeting patterns, and removing any greeting content from the email based on any of the located generated greeting patterns for the recipient and any of the located known greeting patterns; removing any sent from content identifying a device or an email client from which the email was sent based on known sent from content; and outputting remaining email text for further processing; wherein said further processing comprises; analyzing the remaining email text for use in augmenting a sender'"'"'s implicit profile by deriving key words from the remaining email text and storing the key words as a part of the sender'"'"'s implicit profile, the sender'"'"'s implicit profile being located in a recommendation engine database; and analyzing the remaining email text for question and answer content on specific topics and storing the question and answer content in the recommendation engine database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. A computerized system for extracting content from an email, comprising:
-
a computer processor running an email accumulation software application which is enabled to access stored user emails; an email account management module adapted to receive the stored user emails from the email accumulation software application; a self-learning module in communication with the email account management module for analyzing the user emails to determine at least one of user specific email thread patterns, user specific greetings, and user specific signatures; a storage bank for storing the user specific email thread patterns, the user specific greetings, and the user specific signatures together with a user identifier or as part of a user profile associated with a recommendation engine; and an email extraction module adapted for processing each of the emails, said processing comprising; removing any replied content from the email based on at least one of known delimiters for separating original and reply emails, known email thread patterns, and user specific email thread patterns; analyzing the email to locate any generated signature patterns for the sender, any known signature patterns, and any user specific signature patterns, and removing any signature content from the email based on any of the located generated signature patterns for the sender, any of the located known signature patterns, and any of the located user specific signatures; analyzing the email to locate any generated greeting patterns for the recipient, any known greeting patterns, and any user specific greeting patterns, and removing any greeting content from the email based on any of the located generated greeting patterns for the recipient, any of the located known greeting patterns, and any of the located user specific greetings; removing any sent from content identifying a device or an email client from which the email was sent based on known sent from content; and outputting remaining email text for further processing; wherein said further processing comprises; analyzing the remaining email text for use in augmenting a sender'"'"'s implicit profile by deriving key words from the remaining email text and storing the key words as a part of the sender'"'"'s implicit profile, the sender'"'"'s implicit profile being located in a recommendation engine database; and analyzing the remaining email text for question and answer content on specific topics and storing the question and answer content in the recommendation engine database. - View Dependent Claims (32)
-
Specification