×

System and method for generating a taxonomy from a plurality of documents

  • US 6,665,681 B1
  • Filed: 04/09/1999
  • Issued: 12/16/2003
  • Est. Priority Date: 04/09/1999
  • Status: Expired due to Term
First Claim
Patent Images

1. A system for generating a taxonomy for a database, the system comprising:

  • a database including a plurality of pieces of text and a plurality of phrases extracted from the pieces of text;

    means for clustering the phrases extracted from the plurality of pieces of text in order to determine associations between the extracted phrases in the same piece of text and across the plurality of pieces of text;

    means for identifying a leader phrase from the clustered phrases in the database, the leader phrase being associated with a predetermined number of other phrases in the database;

    means for generating a first level of taxonomy based on the identified leader phrases, the leader phrases forming a first level of headings in a hierarchical topical outline; and

    means for generating a second level of the taxonomy based on phrases in the database associated with the leader phrases, the phrases being sub-headings underneath the leader phrases with which they are associated, the taxonomy reflecting the phrases extracted from the pieces of text in the database so that a user searches through the database using the final taxonomy.

View all claims
  • 20 Assignments
Timeline View
Assignment View
    ×
    ×