×

Varying cluster number in a scalable clustering system for use with large databases

  • US 6,449,612 B1
  • Filed: 06/30/2000
  • Issued: 09/10/2002
  • Est. Priority Date: 03/17/1998
  • Status: Expired due to Term
First Claim
Patent Images

1. In a computer system, a method for characterizing data into clusters comprising the steps of:

  • a) providing a candidate cluster set for characterizing a database of data stored on a storage medium, wherein the candidate cluster set includes two or more clustering models having a different number of cluster in their clustering model;

    b) reading a data portion from the database and determining how the data portion fits clustering model within the candidate cluster set;

    c) choosing a best fit of the data portion to determine a selected clustering model from the candidate cluster set and then using the cluster number of said selected clustering model to update the selected clustering model using data portions from the database; and

    d) updating the clustering model using newly sampled data from the database until a specified clustering criteria has been satisfied.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×