Using message sampling to determine the most frequent words in a user mailbox
First Claim
1. A server computer comprising processing and communication resources and configured to:
- receive a request to provide a list of frequently used words in email messages stored in a user mailbox;
if a word frequency list does not exist, start an asynchronous process in part to provide the word frequency list, the asynchronous process operates to;
randomly sample server memory blocks that include email messages and parse a predetermined number of the email messages and generate one or more unique words for each of the predetermined number of email messages; and
generate a word frequency list for the one or more unique words, the word frequency list providing a count associated with the predetermined number of email messages in which each unique word is found;
use the asynchronous process to regenerate the word frequency list if existing and when an age of the word frequency list is greater than an aging limit; and
store the word frequency list as part of a user mailbox including a timestamp to indicate a date and time when the word frequency list was created or updated for the user mailbox.
1 Assignment
0 Petitions
Accused Products
Abstract
A method is presented for generating a list of frequently used words for an email application on a server computer. When a request is received for a word frequency list for emails stored in a user'"'"'s mailbox, a word frequency list is returned if one exists. If the word frequency list does not exist, an asynchronous process is started on the server computer to generate a word frequency list. If the word frequency list exists but it is older than an aging limit, an asynchronous process is started on the server computer to regenerate the word frequency list. The word frequency list is stored in the user'"'"'s mailbox along with a timestamp indicating the date and time that the list was created or updated.
-
Citations
19 Claims
-
1. A server computer comprising processing and communication resources and configured to:
-
receive a request to provide a list of frequently used words in email messages stored in a user mailbox; if a word frequency list does not exist, start an asynchronous process in part to provide the word frequency list, the asynchronous process operates to; randomly sample server memory blocks that include email messages and parse a predetermined number of the email messages and generate one or more unique words for each of the predetermined number of email messages; and generate a word frequency list for the one or more unique words, the word frequency list providing a count associated with the predetermined number of email messages in which each unique word is found; use the asynchronous process to regenerate the word frequency list if existing and when an age of the word frequency list is greater than an aging limit; and store the word frequency list as part of a user mailbox including a timestamp to indicate a date and time when the word frequency list was created or updated for the user mailbox. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method comprising:
-
requesting a list of frequently used words in email messages stored in a user mailbox; using an asynchronous process as part of randomly selecting a first server memory block to identify one or more unique words associated with a number of parsed email messages of the first server memory block to generate a word frequency list to include the one or more unique words associated with the number of parsed email messages of the first server memory block; randomly selecting a second server memory block to identify one or more unique words associated with a number of parsed email messages of the second server memory block and include the one or more unique words associated with the number of parsed email messages of the second server memory block in the word frequency list; updating a count in the word frequency list for each unique word identified in the number of parsed email messages of the second server memory block already included in the word frequency list; using the count to identify a number of email messages that includes each unique word of the word frequency list and using the asynchronous process to regenerate the word frequency list if an age of the word frequency list is greater than an aging limit. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A system including server computer memory, the system configured to:
-
use a list of frequently used words in stored email messages that includes the use of an asynchronous process to generate a word frequency list and regenerate an existing word frequency list based in part on an aging limit, wherein the system uses the asynchronous process to regenerate the existing word frequency list when an age of the existing word frequency list is greater than an aging limit; use a list of one or more server memory blocks that contain email messages for a user mailbox, each of the one or more server memory blocks to store user email messages; randomly use server memory blocks from the list of one or more server memory blocks; parse one or more of the email messages of randomly used server memory blocks to identify unique words of the one or more email messages; create the word frequency list for parsed email messages to include each unique word that is assigned a count in the word frequency list, wherein the count is updated for each parsed email in which each unique word is found; and store a frequency count of each unique word for the user mailbox. - View Dependent Claims (17, 18, 19)
-
Specification