Computerized cluster analysis framework for decorrelated cluster identification in datasets
First Claim
1. A non-transitory computer-readable medium having stored thereon computer-readable instructions that when executed by a computing device cause the computing device to:
- receive data that includes a plurality of observations with a plurality of data points defined for each observation, wherein each data point of the plurality of data points is associated with a variable to define a plurality of variables;
repeatedly select a number of clusters into which to segment the received data by repeatedly executing a clustering algorithm with the received data;
define a plurality of sets of clusters based on the repeated execution of the clustering algorithm that resulted in the selected number of clusters;
define a plurality of composite clusters based on the defined plurality of sets of clusters; and
assign the plurality of observations to the defined plurality of composite clusters using the plurality of data points defined for each observation.
1 Assignment
0 Petitions
Accused Products
Abstract
A computing device to automatically cluster a dataset is provided. Data that includes a plurality of observations with a plurality of data points defined for each observation is received. Each data point of the plurality of data points is associated with a variable to define a plurality of variables. A number of clusters into which to segment the received data is repeatedly selected by repeatedly executing a clustering algorithm with the received data. A plurality of sets of clusters is defined based on the repeated execution of the clustering algorithm that resulted in the selected number of clusters. A plurality of composite clusters is defined based on the defined plurality of sets of clusters. The plurality of observations is assigned to the defined plurality of composite clusters using the plurality of data points defined for each observation.
47 Citations
30 Claims
-
1. A non-transitory computer-readable medium having stored thereon computer-readable instructions that when executed by a computing device cause the computing device to:
-
receive data that includes a plurality of observations with a plurality of data points defined for each observation, wherein each data point of the plurality of data points is associated with a variable to define a plurality of variables; repeatedly select a number of clusters into which to segment the received data by repeatedly executing a clustering algorithm with the received data; define a plurality of sets of clusters based on the repeated execution of the clustering algorithm that resulted in the selected number of clusters; define a plurality of composite clusters based on the defined plurality of sets of clusters; and assign the plurality of observations to the defined plurality of composite clusters using the plurality of data points defined for each observation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A computing device comprising:
-
a processor; and a non-transitory computer-readable medium operably coupled to the processor, the computer-readable medium having computer-readable instructions stored thereon that, when executed by the processor, cause the computing device to receive data that includes a plurality of observations with a plurality of data points defined for each observation, wherein each data point of the plurality of data points is associated with a variable to define a plurality of variables; repeatedly select a number of clusters into which to segment the received data by repeatedly executing a clustering algorithm with the received data; define a plurality of sets of clusters based on the repeated execution of the clustering algorithm that resulted in the selected number of clusters; define a plurality of composite clusters based on the defined plurality of sets of clusters; and assign the plurality of observations to the defined plurality of composite clusters using the plurality of data points defined for each observation.
-
-
30. A method of automatically clustering a dataset, the method comprising:
-
receiving data that includes a plurality of observations with a plurality of data points defined for each observation, wherein each data point of the plurality of data points is associated with a variable to define a plurality of variables; repeatedly selecting, by a computing device, a number of clusters into which to segment the received data by repeatedly executing a clustering algorithm with the received data; defining, by the computing device, a plurality of sets of clusters based on the repeated execution of the clustering algorithm that resulted in the selected number of clusters; defining, by the computing device, a plurality of composite clusters based on the defined plurality of sets of clusters; and assigning, by the computing device, the plurality of observations to the defined plurality of composite clusters using the plurality of data points defined for each observation.
-
Specification