×

Method and System for Seed Based Clustering of Categorical Data

  • US 20070271292A1
  • Filed: 07/12/2006
  • Published: 11/22/2007
  • Est. Priority Date: 05/16/2006
  • Status: Active Grant
First Claim
Patent Images

1. A computerized method of representing a dataset with a taxonomy, comprising:

  • augmenting a dataset containing a plurality of records with a plurality of predetermined exemplars;

    representing the plurality of records and predetermined exemplars within the augmented dataset as a plurality of clusters in an initial taxonomy layer;

    generating a truncated hierarchy of cluster sets based on clusters within the initial taxonomy layer, wherein clusters within the truncated hierarchy contain no more than a predetermined number of exemplars; and

    labeling clusters within the truncated hierarchy.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×