×

Computer implemented scalable, incremental and parallel clustering based on weighted divide and conquer

  • US 6,684,177 B2
  • Filed: 05/10/2001
  • Issued: 01/27/2004
  • Est. Priority Date: 05/10/2001
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for clustering a set S of n data points to find k final centers, comprising:

  • partitioning said set S into P disjoint pieces S1, . . . ,SP;

    for each said piece Si, determining a set Di of k intermediate centers;

    assigning each data point in each piece Si to the nearest one of said k intermediate centers;

    weighting each of said k intermediate centers in each set Di by the number of points in the corresponding piece Si assigned to that center; and

    clustering said weighted intermediate centers together to find said k final centers, said clustering performed using a specific error metric and a clustering method A; and

    wherein if P is not sufficiently large enough such that each piece Si obeys the constraint |Si|<

    M, where M is the size of a physical memory or a portion thereof to be used in processing said each piece, then iteratively performing partitioning, determining, assigning, and weighting until the sets D′

    of weighted intermediate centers generated thereby obeys the constraint |D′

    |<

    M.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×