×

Generation and use of an email frequent word list

  • US 9,165,056 B2
  • Filed: 06/19/2008
  • Issued: 10/20/2015
  • Est. Priority Date: 06/19/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method for generating a mailbox specific frequent word list associated with a mailbox, comprising:

  • performing an index scan on catalogs to retrieve search data mapping words to emails containing the words, the search data provided across multiple mailboxes, the search data comprising an inverted index mapping each of the words to one or more email identifiers identifying each of the emails that contain each of the words;

    generating a universal frequent word list of the emails based on the search data, the universal frequent word list comprising the words contained in the search data and a word frequency associated with each of the words across the multiple mailboxes; and

    generating a plurality of mailbox specific frequent word lists based on the universal frequent word list, each of the plurality of mailbox specific frequent word lists corresponding to one of the multiple mailboxes, each of the plurality of mailbox specific frequent word lists comprising words contained in emails of the corresponding one of the multiple mailboxes and a frequency that the words appear in the emails of the corresponding one of the multiple mailboxes, wherein performing the index scan comprisesreceiving, from an external application, a request for at least one mailbox specific frequent word list of the plurality of mailbox specific frequent word lists,upon receiving the request, determining whether the universal frequent word list has been created,upon determining that the universal frequent word list has not been created, performing the index scan on catalogs to retrieve search data mapping words to emails containing the words,upon determining that the universal frequent word list has been created, determining whether the universal frequent word list is current,upon determining that the universal frequent word list is not current, performing the index scan on catalogs to retrieve search data mapping words to emails containing the words, andupon determining that the universal frequent word list is current, proceeding directly to generating the plurality of mailbox specific frequent word lists based on the universal frequent word list by filtering the words and the corresponding word frequencies associated with the mailbox.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×