×

Method and apparatus for automatically generating hierarchical categories from large document collections

  • US 5,819,258 A
  • Filed: 03/07/1997
  • Issued: 10/06/1998
  • Est. Priority Date: 03/07/1997
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for automatically generating a cluster hierarchy from a large number of documents, the method comprising the steps of:

  • A. generating a set of unique tokens from the documents;

    B. modeling each document in a cluster with one or more of the tokens;

    C. extracting features from the modeled documents in the cluster;

    D. clustering the documents using the extracted features so that the documents in the cluster are subdivided into further clusters; and

    E. repeating steps B, C and D for each cluster generated in step D until a predetermined limit is reached.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×