UNSUPERVISED PRIORITIZATION AND VISUALIZATION OF CLUSTERS
First Claim
1. A network device, comprising:
- a transceiver to send and receive data over a network; and
a processor that is operative to perform actions, comprising;
receiving a dataset of a plurality of attributes for a plurality of entities, each entity being described by a set of attribute values from within the plurality of attributes;
receiving a clustering of the plurality of entities, the clustering describing a plurality of clusters;
for each of a first cluster and a reference cluster, for each attribute of the first cluster, computing an aggregate attribute value;
for each of the first cluster, for each aggregate attribute value, computing an attribute dissimilarity between the aggregate attribute value of the first cluster from the aggregate attribute value of the reference cluster;
combining the attribute dissimilarities for each cluster, to obtain a single cluster dissimilarity for each cluster; and
displaying on a display device, an ordering of each cluster based on their respective cluster dissimilarities to the reference cluster.
7 Assignments
0 Petitions
Accused Products
Abstract
Techniques are disclosed that automatically identify and order the most differentiated clusters from a given collection of clusters within a dataset. A measure of dissimilarity is computed for each cluster from a defined reference cluster, and the clusters are ordered according to the chosen dissimilarity. At least N clusters are selected as the most differentiated clusters relative to the defined reference. Within each cluster, the top-M most distinguishing cluster attributes can be automatically identified by an analogous process that computes the dissimilarity of each cluster attribute to its corresponding attribute in the reference cluster, and orders the attributes by dissimilarity. This then allows for automatic surfacing of what it is about a cluster that differentiates its members relative to the population as a whole, and to provide insight on what action or treatment might be made to address that specific segment of the underlying population.
-
Citations
20 Claims
-
1. A network device, comprising:
-
a transceiver to send and receive data over a network; and a processor that is operative to perform actions, comprising; receiving a dataset of a plurality of attributes for a plurality of entities, each entity being described by a set of attribute values from within the plurality of attributes; receiving a clustering of the plurality of entities, the clustering describing a plurality of clusters; for each of a first cluster and a reference cluster, for each attribute of the first cluster, computing an aggregate attribute value; for each of the first cluster, for each aggregate attribute value, computing an attribute dissimilarity between the aggregate attribute value of the first cluster from the aggregate attribute value of the reference cluster; combining the attribute dissimilarities for each cluster, to obtain a single cluster dissimilarity for each cluster; and displaying on a display device, an ordering of each cluster based on their respective cluster dissimilarities to the reference cluster. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system, comprising:
-
one or more non-transitory storage devices usable to store customer data; and one or more processors operative to perform actions, comprising; receiving a dataset of a plurality of attributes for a plurality of entities, each entity being described by a set of attribute values from within the plurality of attributes; receiving a clustering of the plurality of entities, the clustering describing a plurality of clusters; for each of a first cluster and a reference cluster, for each attribute of the first cluster, computing an aggregate attribute value; for each of the first cluster, for each aggregate attribute value, computing an attribute dissimilarity between the aggregate attribute value of the first cluster from the aggregate attribute value of the reference cluster; combining the attribute dissimilarities for each cluster, to obtain a single cluster dissimilarity for each cluster; and displaying on a display device, an ordering of each cluster based on their respective cluster dissimilarities to the reference cluster. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. An apparatus comprising a non-transitory computer readable medium, having computer-executable instructions stored thereon, that in response to execution by a computing device, cause the computing device to perform operations, comprising:
-
receiving a dataset of a plurality of attributes for a plurality of entities, each entity being described by a set of attribute values from within the plurality of attributes; receiving a clustering of the plurality of entities, the clustering describing a plurality of clusters; for each of a first cluster and a reference cluster, for each attribute of the first cluster, computing an aggregate attribute value; for each of the first cluster, for each aggregate attribute value, computing an attribute dissimilarity between the aggregate attribute value of the first cluster from the aggregate attribute value of the reference cluster; combining the attribute dissimilarities for each cluster, to obtain a single cluster dissimilarity for each cluster; and displaying an ordering of each cluster based on their respective cluster dissimilarities to the reference cluster. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification