Method and System for Subspace Bounded Recursive Clustering of Categorical Data
First Claim
1. A computerized method of representing a dataset, comprising:
- obtaining a dataset, the dataset defining an attribute space;
decomposing the attribute space into a plurality of attribute subspaces;
generating a parent taxonomy of the obtained dataset with respect to one of the plurality of attribute subspaces, the parent taxonomy organizing the obtained dataset into a plurality of data subsets;
generating a child taxonomy with respect to another one of the plurality of attribute subspaces, the child taxonomy organizing each of the plurality of data subsets within the parent taxonomy into at least one data subset;
iteratively repeating generating the child taxonomy until a predetermined termination condition is satisfied, wherein the child taxonomy of a preceding iteration is the parent taxonomy of the current iteration; and
assigning category labels to the data subsets.
1 Assignment
0 Petitions
Accused Products
Abstract
A computerized method of representing a dataset includes obtaining a dataset, the dataset defining an attribute space; decomposing the attribute space into a plurality of attribute subspaces; generating a parent taxonomy of the obtained dataset with respect to one of the plurality of attribute subspaces, the parent taxonomy organizing the obtained dataset into a plurality of data subsets; generating a child taxonomy with respect to another one of the plurality of attribute subspaces, the child taxonomy organizing each of the plurality of data subsets within the parent taxonomy into at least one data subset; iteratively repeating generating the child taxonomy until a predetermined termination condition is satisfied, wherein the child taxonomy of a preceding iteration is the parent taxonomy of the current iteration; and assigning category labels to the data subsets.
38 Citations
20 Claims
-
1. A computerized method of representing a dataset, comprising:
-
obtaining a dataset, the dataset defining an attribute space; decomposing the attribute space into a plurality of attribute subspaces; generating a parent taxonomy of the obtained dataset with respect to one of the plurality of attribute subspaces, the parent taxonomy organizing the obtained dataset into a plurality of data subsets; generating a child taxonomy with respect to another one of the plurality of attribute subspaces, the child taxonomy organizing each of the plurality of data subsets within the parent taxonomy into at least one data subset; iteratively repeating generating the child taxonomy until a predetermined termination condition is satisfied, wherein the child taxonomy of a preceding iteration is the parent taxonomy of the current iteration; and assigning category labels to the data subsets. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer program product comprising a computer usable medium having computer readable code embodied therein for causing a computer to effect:
-
obtaining a dataset, the dataset defining an attribute space; decomposing the attribute space into a plurality of attribute subspaces; generating a parent taxonomy of the obtained dataset with respect to one of the plurality of attribute subspaces, the parent taxonomy organizing the obtained dataset into a plurality of data subsets; generating a child taxonomy with respect to another one of the plurality of attribute subspaces, the child taxonomy organizing each of the plurality of data subsets within the parent taxonomy into at least one data subset; iteratively repeating generating the child taxonomy until a predetermined termination condition is satisfied, wherein the child taxonomy of a preceding iteration is the parent taxonomy of the current iteration; and assigning category labels to the data subsets. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification