Method and system for visualization of clusters and classifications
First Claim
1. A method in a computer system for displaying a representation of categories of data, the method comprising:
- for each category, determining a similarity of that category to every other category so that a similarity is determined for each pair of categories;
displaying an indication of each category; and
for each pair of categories, displaying an indication of the determined similarity of the pair of categories, wherein the displayed indication is an arc connecting the displayed indication of each category in the pair of categories.
2 Assignments
0 Petitions
Accused Products
Abstract
A system that provides for the graphic visualization of the categories of a collection of records. The graphic visualization is referred to as “category graph.” The system optionally displays the category graph as a “similarity graph” or a “hierarchical map.” When displaying a category graph, the system displays a graphic representation of each category. The system displays the category graph as a similarity graph or a hierarchical map in a way that visually illustrates the similarity between categories. The display of a category graph allows a data analyst to better understand the similarity and dissimilarity between categories. A similarity graph includes a node for each category and an arc connecting nodes representing categories whose similarity is above a threshold. A hierarchical map is a tree structure that includes a node for each base category along with nodes representing combinations of similar categories.
-
Citations
42 Claims
-
1. A method in a computer system for displaying a representation of categories of data, the method comprising:
-
for each category, determining a similarity of that category to every other category so that a similarity is determined for each pair of categories;
displaying an indication of each category; and
for each pair of categories, displaying an indication of the determined similarity of the pair of categories, wherein the displayed indication is an arc connecting the displayed indication of each category in the pair of categories. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method in a computer system for displaying a representation of categories of data, the method comprising:
-
for each category, determining a similarity of that category to every other category so that a similarity is determined for each pair of categories;
displaying an indication of each category;
for each pair of categories, displaying an indication of the determined similarity of the pair of categories;
establishing a similarity threshold; and
displaying the indication of the determined similarity for only those pairs of categories whose determined similarity is above the established similarity threshold.
-
-
8. A method in a computer system for displaying a representation of categories of data, the method comprising:
-
for each category, determining a similarity of that category to every other category so that a similarity is determined for each pair of categories;
displaying an indication of each category; and
for each pair of categories, displaying an indication of the determined similarity of the pair of categories, wherein the displayed indication is an arc and thickness of the arc indicates the determined similarity between the pair of categories.
-
-
9. A method in a computer system for displaying a representation of categories of data, the method comprising:
-
for each category, determining a similarity of that category to every other category so that a similarity is determined for each pair of categories;
displaying an indication of each category;
for each pair of categories, displaying an indication of the determined similarity of the pair of categories;
receiving a selection of a displayed indication of a category; and
in response to the selection, displaying information relating to the category;
wherein the data includes attributes and the information relating to the selected category identifies attributes that discriminate the selected category from another category. - View Dependent Claims (10, 11, 12)
where xi represents a value of attribute i and where p(xi|G) represents the conditional probability that a record with an attribute value xi given that the record is in category G.
-
-
12. The method of claim 11 wherein
-
G ) = ∑ h j ε
G p ( h j ) ∏ i p ( x i h j ) ∑ h j ε
G p ( h j ) where hj represents category j, where p(hj) represents the probability that a record is in hj, and where p(xi|hj) is the conditional probability that a record has the value Xi for attribute i is given that the record is in hj.
-
-
13. A method in a computer system for displaying a representation of categories of data, the method comprising:
-
for each category, determining a similarity of that category to every other category so that a similarity is determined for each pair of categories;
displaying an indication of each category;
for each pair of categories, displaying an indication of the determined similarity of the pair of categories;
receiving an indication to de-emphasize a category; and
in response to the indication to de-emphasize a category, de-emphasizing the displayed indication of the category. - View Dependent Claims (14, 15, 16)
in response to receiving the indication to de-emphasize a category, removing the displayed indication of the de-emphasized category.
-
-
15. The method of claim 13 wherein the de-emphasizing is dimming of the displayed indication of the de-emphasized category.
-
16. The method of claim 13 wherein the de-emphasizing is hiding of the displayed indication of the category.
-
17. A method in a computer system for displaying a representation of categories of data, the method comprising:
-
for each category, determining a similarity of that category to every other category so that a similarity is determined for each pair of categories;
displaying an indication of each category;
for each pair of categories, displaying an indication of the determined similarity of the pair of categories;
receiving an indication to split a combined category; and
in response to the indication to split a combined category, displaying an indication of a pair of categories for the combined category. - View Dependent Claims (18, 19, 20, 21)
displaying a slider and wherein movement of the displayed slider indicates to split a combined category.
-
-
19. The method of claim 17 wherein the displaying an indication of a pair of categories includes displaying an animation of splitting the indication of the combined category into the pair of indications of categories.
-
20. The method of claim 17 wherein the category to be split is the combined category that was last combined.
-
21. The method of claim 17 including:
displaying a control and wherein selection of the control indicates to split categories.
-
22. A method in a computer system for displaying a representation of categories of data, the method comprising:
-
receiving a hierarchical organization of the categories, the categories including a root category and leaf categories, each category except the leaf categories being a combined category;
displaying an indication of each category in the hierarchical organization;
receiving an indication to de-emphasize a specific category; and
in response to the indication to de-emphasize a category, de-emphasizing the displayed indications of categories in a sub-tree of which the specific category is a root. - View Dependent Claims (23, 24, 25)
-
-
26. A method in a computer system for displaying a representation of categories of data, the method comprising:
-
receiving a hierarchical organization of the categories, the categories including a root category and leaf categories, each category except the leaf categories being a combined category;
displaying an indication of each category in the hierarchical organization;
receiving an indication to de-emphasize a specific category; and
in response to the indication to de-emphasize a category, removing all displayed indications of categories in a sub-tree of which the specific category is a root, excluding the displayed indication of the category corresponding to the root of the sub-tree.
-
-
27. A method in a computer system for displaying a representation of categories of data, the method comprising:
-
receiving a hierarchical organization of the categories, the categories including a root category and leaf categories, each category except the leaf categories being a combined category;
displaying an indication of each category in the hierarchical organization;
receiving a selection of a displayed indication of a category; and
in response to the selection, displaying information relating to the selected category. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
where xi represents a value of attribute i and where p(xi|G) represents the conditional probability that a record with an attribute value xi given that the record is in category G.
-
-
36. The method of claim 35 wherein
-
G ) = ∑ h j ε
G p ( h j ) ∏ i p ( x i h j ) ∑ h j ε
G p ( h j ) where hj represents category j, where p(hj) represents the probability that a record is in hj, and where p(xi|hj) is the conditional probability that a record has the value Xi for attribute i is given that the record is in hj.
-
-
37. The method of claim 27 wherein the data includes attributes and wherein the information relating to the selected category identifies attributes are characteristic of the selected category.
-
38. The method of claim 27 wherein the information relating to the selected category is indicates the homogeneity of the category.
-
39. The method of claim 27 wherein a homogeneity is given by the following equation:
-
where G represents a category or combined category, where p(G|x1, . . . ,xm) represents the probability that category G contains the record with attribute values x1, . . . ,xm, and where p(x1, . . . ,xm|G) represents the conditional probability that a record has attribute values x1, . . . ,xm given that it is in category G.
-
-
40. The method of claim 27 wherein the displayed information relates to the similarity of a sub-categories of a combined category.
-
41. A method in a computer system for displaying a representation of categories of data, the method comprising:
-
receiving a hierarchical organization of the categories, the categories including a root category and leaf categories, each category except the leaf categories being a combined category; and
displaying an indication of each category in the hierarchical organization;
wherein the displayed indication of each category includes an indication of the number of records in said each category. - View Dependent Claims (42)
-
Specification