Production and preprocessing system for data mining
First Claim
Patent Images
1. A method of preprocessing for data mining, comprising the steps of:
- creating, from XML data, a hierarchical unit tree as a tree structure in which attributes of the XML data are set as a leaf node and a non-leaf node, a relationship between the attributes without including an attribute value is expressed, and a redundant parent-child relationship between the nodes is optimized by merging;
adding a change to the hierarchical unit tree; and
converting the XML data so as to reflect the change added to the hierarchical unit tree.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed is means capable of solving trouble in managing data formats and procedures and capable of carrying out advanced preprocessing more intuitively. A data aggregate to be inputted to a mining engine is converted into hierarchical unit trees, and node conditions of the hierarchical unit trees are changed, whereby the data aggregate and a data structure are subjected to dynamic conversion/edition processing. Thus, a system is constructed, in which preprocessing for data mining is unitarily managed/semi-automated.
-
Citations
9 Claims
-
1. A method of preprocessing for data mining, comprising the steps of:
-
creating, from XML data, a hierarchical unit tree as a tree structure in which attributes of the XML data are set as a leaf node and a non-leaf node, a relationship between the attributes without including an attribute value is expressed, and a redundant parent-child relationship between the nodes is optimized by merging;
adding a change to the hierarchical unit tree; and
converting the XML data so as to reflect the change added to the hierarchical unit tree.
-
-
2. A method of preprocessing for data mining, comprising the steps of:
-
displaying, on a screen, a hierarchical unit tree as a tree structure in which a leaf node and a non-leaf node, and a branch expressing a parent-child relationship between the nodes are included, both of the nodes corresponding to attributes of XML data, and a redundant parent-child relationship between the nodes is optimized by merging, the hierarchical unit tree being created from the XML data;
adding a change to the hierarchical unit tree; and
converting the XML data so as to reflect the change added to the hierarchical unit tree. - View Dependent Claims (3, 4, 5, 6, 7)
-
-
8. A preprocessing system for data mining, comprising:
-
a display unit for displaying a hierarchical unit tree as a tree structure in which a leaf node and a non-leaf node, and a branch expressing a parent-child relationship between the nodes are included, both of the nodes corresponding to attributes of XML data, and a redundant parent-child relationship between the nodes is optimized by merging, the hierarchical unit tree being created from the XML data; and
a filter selection unit for selecting a filter for adding a change to the hierarchical unit tree. - View Dependent Claims (9)
-
Specification