Distributed clustering method and system
First Claim
Patent Images
1. A method for clustering data in a system having an integrator and at least two computing units comprising:
- (a) loading each computing unit with common global parameter values and a particular local data set;
(b) each computing unit generating local sufficient statistics based on the local data set and global parameter values; and
(c) employing the local sufficient statistics of all the computing units to update the global parameter values.
9 Assignments
0 Petitions
Accused Products
Abstract
A distributed data clustering system having an integrator and at least two computing units. Each computing unit is loaded with common global parameter values and a particular local data set. Each computing unit then generates local sufficient statistics based on the local data set and global parameter values. The integrator employs the local sufficient statistics of all the computing units to update the global parameter values.
-
Citations
20 Claims
-
1. A method for clustering data in a system having an integrator and at least two computing units comprising:
-
(a) loading each computing unit with common global parameter values and a particular local data set;
(b) each computing unit generating local sufficient statistics based on the local data set and global parameter values; and
(c) employing the local sufficient statistics of all the computing units to update the global parameter values. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A distributed data clustering system comprising:
-
(a) a first computing unit for performing data clustering based on a first local data set that is a subset of data points to be clustered and global parameter values to generate first local sufficient statistics;
(b) a second computing unit for performing data clustering based on a second local data set that is a subset of the data points to be clustered and global parameter values to generate second local sufficient statistics; and
(c) a integrator unit for receiving the first and second local sufficient statistics from the first and second computing units, respectively, and for employing the first and second local sufficient statistics to update the global parameter values. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification