Parameter set determination for clustering of datasets
First Claim
Patent Images
1. A clustering cycle optimization system comprising:
- a data storage to store parameters for clustering a data set; and
one or more processors to perform operations to;
generate an initial parameter set of a plurality of parameter sets via random selection of parameter values, the initial parameter set including different types of parameters;
cause random changes in values of the different types of parameters in the initial parameter set, wherein the random changes include changes of a predetermined percentage caused in the values of numerical parameters, toggling Boolean parameters and resampling categorical parameters;
generate the plurality of parameter sets based on the random changesgenerate a plurality of cluster solutions from each of the plurality of parameter sets, the plurality of cluster solutions generated for each parameter set comprising an optimized cluster solution having a highest total score of the plurality of cluster solutions generated for the parameter set;
obtain a respective fitness score for each parameter set of the plurality of parameter sets, the respective fitness score of the parameter set being based on total scores of the plurality of cluster solutions generated for the parameter set;
select a hyper-optimized parameter set from the plurality of parameter sets based on a comparison of the respective fitness scores of the plurality of parameter sets;
select a hyper-optimized cluster solution from the plurality of cluster solutions generated from the hyper-optimized parameter set, the selection of the hyper-optimized cluster solution being based on comparisons of the total scores of the plurality of cluster solutions generated for the hyper-optimized parameter set; and
perform an actionable item that includes transmission of a communication by an external system to a group of users based on the hyper-optimized cluster solution.
1 Assignment
0 Petitions
Accused Products
Abstract
A clustering system selects a parameter set from a plurality of parameter sets associated with a dataset to generate a hyper-optimized cluster of the dataset. Different parameter sets are generated by varying the parameter values based on the Genetic Algorithm or a particle swarming algorithm. A parameter set having a high Fitness score is selected from the different parameter sets and a clustered solution produced using the selected parameter set having a maximum total score is used to produce actionable items.
13 Citations
12 Claims
-
1. A clustering cycle optimization system comprising:
-
a data storage to store parameters for clustering a data set; and one or more processors to perform operations to; generate an initial parameter set of a plurality of parameter sets via random selection of parameter values, the initial parameter set including different types of parameters; cause random changes in values of the different types of parameters in the initial parameter set, wherein the random changes include changes of a predetermined percentage caused in the values of numerical parameters, toggling Boolean parameters and resampling categorical parameters; generate the plurality of parameter sets based on the random changes generate a plurality of cluster solutions from each of the plurality of parameter sets, the plurality of cluster solutions generated for each parameter set comprising an optimized cluster solution having a highest total score of the plurality of cluster solutions generated for the parameter set; obtain a respective fitness score for each parameter set of the plurality of parameter sets, the respective fitness score of the parameter set being based on total scores of the plurality of cluster solutions generated for the parameter set; select a hyper-optimized parameter set from the plurality of parameter sets based on a comparison of the respective fitness scores of the plurality of parameter sets; select a hyper-optimized cluster solution from the plurality of cluster solutions generated from the hyper-optimized parameter set, the selection of the hyper-optimized cluster solution being based on comparisons of the total scores of the plurality of cluster solutions generated for the hyper-optimized parameter set; and perform an actionable item that includes transmission of a communication by an external system to a group of users based on the hyper-optimized cluster solution. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer implemented method for hyper-optimizing a clustering engine, comprising:
-
generating an initial parameter set of a plurality of parameter sets via random selection of parameter values, the initial parameter set including different types of parameters; causing random changes in values of the different types of parameters in the initial parameter set, wherein the random changes include changes of a predetermined percentage caused in the values of numerical parameters, toggling Boolean parameters and resampling categorical parameters; generating the plurality of parameter sets based on the random changes; generating a plurality of cluster solutions from each of the plurality of parameter sets, the plurality of cluster solutions generated for each parameter set comprising an optimized cluster solution having a highest total score of the plurality of cluster solutions generated for the parameter set; obtaining a respective fitness score for each parameter set of the plurality of parameter sets, the respective fitness score of the parameter set being based on total scores of the plurality of cluster solutions generated for the parameter set; selecting a hyper-optimized parameter set from the plurality of parameter sets based on a comparison of the respective fitness scores of the plurality of parameter sets; selecting a hyper-optimized cluster solution from the plurality of cluster solutions generated from the hyper-optimized parameter set, the selection of the hyper-optimized cluster solution being based on comparisons of the total scores of the plurality of cluster solutions generated for the hyper-optimized parameter set; and performing an actionable item that includes transmission of a communication by an external system to a group of users based on the hyper-optimized cluster solution. - View Dependent Claims (9, 10, 11)
-
-
12. A non-transitory computer readable medium including machine readable instructions that are executable by at least one processor to:
-
generate an initial parameter set of a plurality of parameter sets via random selection of parameter values, the initial parameter set including different types of parameters; cause random changes in values of the different types of parameters in the initial parameter set, wherein the random changes include changes of a predetermined percentage caused in the values of numerical parameters, toggling Boolean parameters and resampling categorical parameters; generate the plurality of parameter sets based on the random changes; generate a plurality of cluster solutions from each of the plurality of parameter sets, the plurality of cluster solutions generated for each parameter set comprising an optimized cluster solution having a highest total score of the plurality of cluster solutions generated for the parameter set; obtain a respective fitness score for each parameter set of the plurality of parameter sets, the respective fitness score of the parameter set being based on total scores of the plurality of cluster solutions generated for the parameter set; select a hyper-optimized parameter set from the plurality of parameter sets based on a comparison of the respective fitness scores of the plurality of parameter sets; select a hyper-optimized cluster solution from the plurality of cluster solutions generated from the hyper-optimized parameter set, the selection of the hyper-optimized cluster solution being based on comparisons of the total scores of the plurality of cluster solutions generated for the hyper-optimized parameter set; and perform an actionable item that includes transmission of a communication by an external system to a group of users based on the hyper-optimized cluster solution.
-
Specification