×

Automated category discovery for a terminological knowledge base

  • US 6,513,027 B1
  • Filed: 03/16/1999
  • Issued: 01/28/2003
  • Est. Priority Date: 03/16/1999
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for automated generation of sub-categories from categories of a terminological knowledge base, said method comprising the steps of:

  • storing a corpus of documents, wherein a document comprises a plurality of themes, and a theme identifies thematic content contained within said document;

    storing a knowledge base comprising a plurality of hierarchically arranged categories, wherein a subset of said categories of said knowledge base comprise dimensional categories;

    selecting a target category in said knowledge base to generate at least one new sub-category, said target category comprising a plurality of terms classified within said target category, such that one or more groups of terms associated with said target category are divided for association with said new sub-category;

    selecting, for each term classified within said target category, a plurality of themes from said corpus of documents;

    generating a plurality of dimensional category vectors, one for each term, by associating said themes selected for a term to a dimensional category;

    determining if one or more terminological groups of terms exist in said knowledge base by clustering said dimensional category vectors for each term; and

    selecting, as said new sub-category for said target category, one or more terminological groups discovered.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×