System for folder classification based on folder content similarity and dissimilarity
First Claim
Patent Images
1. A computerized method of representing a dataset with a taxonomy, comprising:
- obtaining a dataset containing a plurality of records;
initializing a folder-set containing a plurality of folders;
assigning labels to folders within the folder set;
classifying the plurality of records into the plurality of folders according to a predetermined entropic similarity condition; and
merging a plurality of folders when it is determined that a similarity between the plurality of folders is greater than a dissimilarity between the folders;
wherein the computerized method of representing a dataset with a taxonomy occurs within a physical computer.
1 Assignment
0 Petitions
Accused Products
Abstract
A computerized method of representing a dataset with a taxonomy includes obtaining a dataset containing a plurality of records; initializing a folder-set containing a plurality of folders; assigning labels to folders within the folder set; and classifying the plurality of records into the plurality of folders according to a predetermined entropic similarity condition.
-
Citations
18 Claims
-
1. A computerized method of representing a dataset with a taxonomy, comprising:
-
obtaining a dataset containing a plurality of records; initializing a folder-set containing a plurality of folders; assigning labels to folders within the folder set; classifying the plurality of records into the plurality of folders according to a predetermined entropic similarity condition; and merging a plurality of folders when it is determined that a similarity between the plurality of folders is greater than a dissimilarity between the folders; wherein the computerized method of representing a dataset with a taxonomy occurs within a physical computer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer program product comprising a physical computer usable medium having computer readable code embodied therein for causing a physical computer to effect:
-
obtaining a dataset containing a plurality of records; initializing a folder-set containing a plurality of folders; assigning labels to folders within the folder set; classifying the plurality of records into the plurality of folders according to a predetermined entropic similarity condition; and a computer usable medium having computer readable code embodied therein for causing a computer to effect merging a plurality of folders when it is determined that a similarity between the plurality of folders is greater than a dissimilarity between the folders. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification