×

Lightweight subject indexing for e-mail collections

  • US 6,873,985 B2
  • Filed: 12/06/2001
  • Issued: 03/29/2005
  • Est. Priority Date: 12/06/2001
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for creating a light weight subject index, comprising:

  • identifying, as candidate headwords, words in the subject lines of a collection of documents which are not listed in a user modified common word list;

    creating lexical contexts for identified candidate headwords;

    ranking the set of identified candidate headwords for a collection of documents and selecting among them for inclusion in an index; and

    listing selected candidate headwords based on the results of ranking and selection, wherein the lexical context for a candidate headword within a subject line is identified as the words to the left and the right of the candidate headword up to, but not including, a barrier word.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×