×

Document categorizing method, document categorizing apparatus, and storage medium on which a document categorization program is stored

  • US 7,213,205 B1
  • Filed: 06/02/2000
  • Issued: 05/01/2007
  • Est. Priority Date: 06/04/1999
  • Status: Expired due to Fees
First Claim
Patent Images

1. A document categorizing method for categorizing a plurality of documents in an electronic system according to semantic similarity, said method comprising:

  • obtaining a plurality of clusters of documents, each cluster having a distinctive name;

    evaluating a degree of relation between at least two clusters by evaluating the similarity between the evaluated clusters based on the documents included in the respective evaluated clusters;

    merging the evaluated clusters into a new combined cluster when their degree of relation is determined to be not less than a predetermined first value; and

    assigning a new name to said new combined cluster based on the degree of relation between its constituent evaluated clusters;

    wherein;

    if the degree of relation of said constituent evaluated clusters is less then a second predetermined value, which is greater than said first predetermined value, the new name assigned to said new combined cluster conforms to a first naming convention indicative of a degree of relation between said first and second predetermined values; and

    if the degree of relation of said constituent evaluated clusters is not less then said second predetermined value, the new name assigned to said new combined cluster conforms to a second naming convention indicative of a degree of relation not less than said second predetermined value; and

    wherein;

    said first naming convention includes a concatenation of at least a name segment of each of said constituent evaluated clusters with a first delimiter inserted between the concatenated name segments; and

    said second naming convention includes a concatenation of at least a name segment of each of said constituent evaluated clusters with a second delimiter, different from said first delimiter, inserted between the concatenated name segments.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×