Information processing apparatus, clustering method, and recording medium storing clustering program
First Claim
Patent Images
1. An information processing apparatus that clusters an input data set, comprising:
- a memory having stored thereon (i) a plurality of reference data sets that were previously clustered, the plurality of reference data sets correspond to a plurality of reference images, and (ii) a first reference parameter for clustering a first reference data set of the plurality of reference data sets, the first reference data set of the plurality of reference data sets including a plurality of clusters corresponding to a region of a respective one of the plurality of reference images, the first reference parameter being a model parameter of mixture distribution, the first reference parameter corresponding to at least one of a number of clusters in the first reference data set, and a centroid of data points in each of the plurality of clusters; and
at least one processor configured to execute computer readable instructions to,search the memory to obtain the first reference parameter of at least one of the plurality of reference data sets that is similar to the input data set,determine an initial value of the model parameter of mixture distribution of the input data set by combining (a) the first reference parameter of the first reference data set with (b) a second reference parameter of a second reference data set based on similarity between the input data set and the combined first and second reference data sets, at least one of the first reference parameter and the second reference parameter are obtained from the plurality of reference images,modify the initial value of the model parameter of mixture distribution of the input data set to match a probability density distribution of the input data set to generate an updated initial value, andcluster the probability density distribution of the input data set on a feature space of the input data set using the updated initial value of the model parameter of mixture distribution.
1 Assignment
0 Petitions
Accused Products
Abstract
An information processing apparatus, a clustering method, and a clustering program stored on a recording medium, each of which determines an initial value of model parameter of an input data set based on a model parameter of a reference data set that is similar to the input data set and is previously clustered, modifies the initial value so as to match the input data set, and to obtain a clustering result of the input data set using the updated initial value of model parameter.
18 Citations
20 Claims
-
1. An information processing apparatus that clusters an input data set, comprising:
-
a memory having stored thereon (i) a plurality of reference data sets that were previously clustered, the plurality of reference data sets correspond to a plurality of reference images, and (ii) a first reference parameter for clustering a first reference data set of the plurality of reference data sets, the first reference data set of the plurality of reference data sets including a plurality of clusters corresponding to a region of a respective one of the plurality of reference images, the first reference parameter being a model parameter of mixture distribution, the first reference parameter corresponding to at least one of a number of clusters in the first reference data set, and a centroid of data points in each of the plurality of clusters; and at least one processor configured to execute computer readable instructions to, search the memory to obtain the first reference parameter of at least one of the plurality of reference data sets that is similar to the input data set, determine an initial value of the model parameter of mixture distribution of the input data set by combining (a) the first reference parameter of the first reference data set with (b) a second reference parameter of a second reference data set based on similarity between the input data set and the combined first and second reference data sets, at least one of the first reference parameter and the second reference parameter are obtained from the plurality of reference images, modify the initial value of the model parameter of mixture distribution of the input data set to match a probability density distribution of the input data set to generate an updated initial value, and cluster the probability density distribution of the input data set on a feature space of the input data set using the updated initial value of the model parameter of mixture distribution. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 18)
-
-
9. A method of clustering an input data set, comprising:
-
storing, in a memory, for each of a plurality of reference data sets corresponding to a respective one of the plurality of reference images that were previously clustered, a first reference parameter for clustering a first reference data set of the plurality of reference data sets, the reference data set of the plurality of reference data sets including a plurality of clusters corresponding to a region of the respective one of the plurality of reference images, the first reference parameter being a model parameter of mixture distribution, the first reference parameter corresponding to at least one of a number of clusters in the first reference data set, and a centroid of data points in each of the plurality of clusters; searching, using at least one processor, the memory to obtain the first reference parameter of at least one of the plurality of reference data sets that is similar to the input data set; determining, using the at least one processor, an initial value of the model parameter of mixture distribution of the input data set by combining the first reference parameter of the first reference data set with a second reference parameter of a second reference data set based on similarity between the input data set and the combined first and second reference data sets, the first reference parameter and the second reference parameter included in a plurality of reference parameters, at least one of the first reference parameter and the second reference parameter are obtained from the plurality of reference images; modifying, using the at least one processor, the initial value of the model parameter of mixture distribution of the input data set to match a probability density distribution of the input data set to generate an updated initial value; and clustering, using the at least one processor, the probability density distribution of the input data set on a feature space of the input data set using the updated initial value of the model parameter of mixture distribution. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 19)
-
-
17. A non-transitory computer readable recording medium having computer readable instructions stored thereon, that when executed by at least one processor, configure the at least one processor to:
-
store, in a memory, for each of a plurality of reference data sets corresponding to a respective one of the plurality of reference images that were previously clustered, a first reference parameter for clustering a first reference data set of the plurality of reference data sets, the first reference data set of the plurality of reference data sets including a plurality of clusters corresponding to a region of the respective one of the plurality of reference images, the first reference parameter being a model parameter of mixture distribution, the first reference parameter corresponding to at least one of a number of clusters in the first reference data set, and a centroid of data points in each of the plurality of clusters; search the memory to obtain the first reference parameter of at least one of the plurality of reference data sets that is similar to an input data set; determine an initial value of the model parameter of mixture distribution of the input data set by combining the first reference parameter of the first reference data set with a second reference parameter of a second reference data set based on similarity between the input data set and the combined first and second reference data sets, at least one of the first reference parameter and the second reference parameter are obtained from the plurality of reference images; modify the initial value of the model parameter of mixture distribution of the input data set to match a probability density distribution of the input data set to generate an updated initial value; and cluster the probability density distribution of the input data set on a feature space of the input data set using the updated initial value of the model parameter of mixture distribution. - View Dependent Claims (20)
-
Specification