INFORMATION PROCESSING APPARATUS, CLUSTERING METHOD, AND RECORDING MEDIUM STORING CLUSTERING PROGRAM
First Claim
Patent Images
1. An information processing apparatus that clusters an input data set, comprising:
- a memory to store, for each of a plurality of reference data sets that is previously clustered, a reference parameter used for clustering the reference data set, the reference parameter being a model parameter of mixture distribution; and
a processor tosearch the memory to obtain the reference parameter of at least one reference data set that is similar to the input data set;
determine an initial value of a model parameter of mixture distribution of the input data set, based on the reference parameter of the at least one reference data set;
modify the initial value of the model parameter of mixture distribution of the input data set, so as to match a probability density distribution of the input data set to generate an updated initial value; and
cluster the probability density distribution of the input data set on a feature space of the input data set, using the updated initial value of the model parameter of mixture distribution.
1 Assignment
0 Petitions
Accused Products
Abstract
An information processing apparatus, a clustering method, and a clustering program stored on a recording medium, each of which determines an initial value of model parameter of an input data set based on a model parameter of a reference data set that is similar to the input data set and is previously clustered, modifies the initial value so as to match the input data set, and to obtain a clustering result of the input data set using the updated initial value of model parameter.
20 Citations
17 Claims
-
1. An information processing apparatus that clusters an input data set, comprising:
-
a memory to store, for each of a plurality of reference data sets that is previously clustered, a reference parameter used for clustering the reference data set, the reference parameter being a model parameter of mixture distribution; and a processor to search the memory to obtain the reference parameter of at least one reference data set that is similar to the input data set; determine an initial value of a model parameter of mixture distribution of the input data set, based on the reference parameter of the at least one reference data set; modify the initial value of the model parameter of mixture distribution of the input data set, so as to match a probability density distribution of the input data set to generate an updated initial value; and cluster the probability density distribution of the input data set on a feature space of the input data set, using the updated initial value of the model parameter of mixture distribution. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of clustering an input data set, comprising:
-
storing, in a memory, for each of a plurality of reference data sets that is previously clustered, a reference parameter used for clustering the reference data set, the reference parameter being a model parameter of mixture distribution; searching the memory to obtain the reference parameter of at least one reference data set that is similar to the input data set; determining an initial value of a model parameter of mixture distribution of the input data set, based on the reference parameter of the at least one reference data set; modifying the initial value of the model parameter of mixture distribution of the input data set, so as to match a probability density distribution of the input data set to generate an updated initial value; and clustering the probability density distribution of the input data set on a feature space of the input data set, using the updated initial value of the model parameter of mixture distribution. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory recording medium storing a plurality of instructions which, when executed by a processor, cause the processor to perform a method of clustering an input data set, the method comprising:
-
storing, in a memory, for each of a plurality of reference data sets that is previously clustered, a reference parameter used for clustering the reference data set, the reference parameter being a model parameter of mixture distribution; searching the memory to obtain the reference parameter of at least one reference data set that is similar to the input data set; determining an initial value of a model parameter of mixture distribution of the input data set, based on the reference parameter of the at least one reference data set; modifying the initial value of the model parameter of mixture distribution of the input data set, so as to match a probability density distribution of the input data set to generate an updated initial value; and clustering the probability density distribution of the input data set on a feature space of the input data set, using the updated initial value of the model parameter of mixture distribution.
-
Specification