×

Method for categorizing documents into subjects using relevance normalization for documents retrieved from an information retrieval system in response to a query

  • US 5,717,914 A
  • Filed: 09/15/1995
  • Issued: 02/10/1998
  • Est. Priority Date: 09/15/1995
  • Status: Expired due to Term
First Claim
Patent Images

1. A subjector for selectively storing input information in an information retrieval system database, said input information being formed of a collection of English language words, comprising:

  • at least one information subject category within said information retrieval system database;

    a first plurality of subjected documents selected from said information retrieval system database and relating to said first information subject category;

    a preliminary lexicon determined in accordance with said first plurality of subjected documents wherein an information comparing unit first compares selected documents with said preliminary lexicon;

    a second plurality of documents selected in accordance with said first comparing, wherein a determination is made whether documents of said second plurality of documents belong in said first subject category and documents are removed from said second plurality of documents in accordance with said determining whether said documents belong in said second plurality of documents to provide a remaining third plurality of documents, and said information comparing unit second compares said third plurality of documents to determine said first subject lexicon in accordance with said third plurality of documents;

    said first subject lexicon corresponding to said information subject category and containing information representative of said information subject category, said first subject lexicon containing a plurality of classifier words, wherein generally all of said classifier words in said first subject lexicon contain at least one English language word;

    an information comparing unit for third comparing said collection of English language words from said input information with said classifier words from said first subject lexicon, wherein said collection of English language words from said input information form at least one document; and

    memory for storing said input information in said information subject category in accordance with said third comparing.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×