Methods and systems for e-mail topic classification
First Claim
Patent Images
1. A method for processing e-mails, the method comprising:
- receiving, at one or more computer systems, a plurality of e-mails obtained from an e-mail repository;
storing information representing each e-mail in the plurality of e-mails in a storage device associated with the one or more computer systems, stored information representing a given e-mail including at least one of content of the given e-mail and message attribute data of the given e-mail;
for each e-mail in the plurality of e-mails,determining, with one or more processors associated with the one or more computer systems, a set of noun phrases included in the e-mail based on stored information representing the e-mail, andgenerating, with the one or more processors associated with the one or more computer systems, a feature representation for the e-mail based on the determined set of noun phrases included in the e-mail, a feature representation for a given e-mail scoring selected noun phrases in a set of noun phrases determined to be included in the given e-mail;
generating, with the one or more processors associated with the one or more computer systems, a set of topics based on the feature representation generated for each e-mail in the plurality of emails; and
storing each topic in the set of topics in a storage device associated with the one or more computer systems in association with at least some of the plurality of e-mails.
7 Assignments
0 Petitions
Accused Products
Abstract
A method for processing e-mails includes receiving a plurality of e-mails. For each e-mail in the plurality of e-mails, a feature representation is generated for an e-mail based on a set of noun phrases associated with the e-mail. A set of topics associated with the plurality of e-mails is generated based on the feature representation for each e-mail. Sentence structure associated with the e-mail and parts of speech associated with the e-mail may be determined. The parts of speech, including a set of noun phrases associated with the e-mail, may be used to generate the feature representation for the e-mail.
-
Citations
21 Claims
-
1. A method for processing e-mails, the method comprising:
-
receiving, at one or more computer systems, a plurality of e-mails obtained from an e-mail repository; storing information representing each e-mail in the plurality of e-mails in a storage device associated with the one or more computer systems, stored information representing a given e-mail including at least one of content of the given e-mail and message attribute data of the given e-mail; for each e-mail in the plurality of e-mails, determining, with one or more processors associated with the one or more computer systems, a set of noun phrases included in the e-mail based on stored information representing the e-mail, and generating, with the one or more processors associated with the one or more computer systems, a feature representation for the e-mail based on the determined set of noun phrases included in the e-mail, a feature representation for a given e-mail scoring selected noun phrases in a set of noun phrases determined to be included in the given e-mail; generating, with the one or more processors associated with the one or more computer systems, a set of topics based on the feature representation generated for each e-mail in the plurality of emails; and storing each topic in the set of topics in a storage device associated with the one or more computer systems in association with at least some of the plurality of e-mails. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer-readable medium storing a set of code modules which when executed by a processor of a computer system cause the processor to process e-mails, the computer-readable medium comprising:
-
code for receiving a plurality of e-mails obtained from an e-mail repository; code for storing information representing each e-mail in the plurality of e-mails, stored information representing a given e-mail including at least one of content of the given e-mail and message attribute data of the given e-mail; code for determining, for each e-mail in the plurality of e-mails, a set of noun phrases included in an e-mail based on the stored information representing the e-mail; code for generating, for each e-mail in the plurality of e-mails, a feature representation for an e-mail based on a determined set of noun phrases associated with included in the e-mail, a feature representation for a given e-mail scoring selected noun phrases in a determined set of noun phrases included in the given e-mail; code for generating a set of topics based on a feature representation generated for each e-mail in the plurality of e-mails; and code for storing each topic in the set of topics in association with at least some of the plurality of e-mails. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system for processing e-mails, the system comprising:
-
a processor; and a memory coupled to the processor, the memory configured to store a set of instructions which when executed by the processor configure the processor to; receive a plurality of e-mails obtained from an e-mail repository; store information representing each e-mail in the plurality of e-mails, stored information representing a given e-mail including at least one of content of the given e-mail and message attribute data of the given e-mail; for each e-mail in the plurality of e-mails, determine a set of noun phrases included in the e-mail based on stored information representing the e-mail, and generate a feature representation for the e-mail based on the determined set of noun phrases included in the e-mail, a feature representation for a given e-mail scoring selected noun phrases in a set of noun phrases determined to be included in the given e-mail; generate a set of topics based on the feature representation generated for each e-mail in the plurality of e-mails; and store each topic in the set of topics in association with at least some of the plurality of e-mails. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification