Unified relational database model for data mining
First Claim
1. A method for managing data mining activities in a data mining environment, the data mining environment having data sets, a data mining tool, a data mining model, model training results stored in model training results files, and scoring output data stored in records in model scoring results tables, the method comprising:
- selecting a model scoring results table, wherein the selecting is carried out in dependence upon metadata included in a model scoring results control table, the model scoring results control table being related to a data set control table including data set metadata;
reading a scoring output data record from the selected model scoring results table;
storing the scoring output data record in a managed representation table for scoring results;
selecting a model training results file, wherein the selecting is carried out in dependence upon metadata included in a mining model control table, the mining model control table being related to the data set control table and the model scoring results control table;
reading training results data from the selected model training results file; and
storing the training results data in at least one model training results table, the at least one model training results table comprising a relational representation of the training results data from the selected model training results file, one of the at least one model training results table referencing the mining model control table, the managed representation table for scoring results being related to the referencing model training results table, the data set control table, the mining model control table, and the model scoring results control table.
1 Assignment
0 Petitions
Accused Products
Abstract
Managing data mining activities in a data mining environment, including selecting a model scoring results table, wherein the selecting is carried out in dependence upon metadata included in a model scoring results control table, the model scoring results control table being related to a data set control table including data set metadata; reading a scoring output data record from the selected model scoring results table; storing the scoring output data record in a managed representation table for scoring results; selecting a model training results file, wherein the selecting is carried out in dependence upon metadata included in a mining model control table; reading training results data from the selected model training results file; and storing the training results data in a model training results table, the model training results table comprising a relational representation of the training results data from the selected model training results file.
67 Citations
78 Claims
-
1. A method for managing data mining activities in a data mining environment, the data mining environment having data sets, a data mining tool, a data mining model, model training results stored in model training results files, and scoring output data stored in records in model scoring results tables, the method comprising:
-
selecting a model scoring results table, wherein the selecting is carried out in dependence upon metadata included in a model scoring results control table, the model scoring results control table being related to a data set control table including data set metadata;
reading a scoring output data record from the selected model scoring results table;
storing the scoring output data record in a managed representation table for scoring results;
selecting a model training results file, wherein the selecting is carried out in dependence upon metadata included in a mining model control table, the mining model control table being related to the data set control table and the model scoring results control table;
reading training results data from the selected model training results file; and
storing the training results data in at least one model training results table, the at least one model training results table comprising a relational representation of the training results data from the selected model training results file, one of the at least one model training results table referencing the mining model control table, the managed representation table for scoring results being related to the referencing model training results table, the data set control table, the mining model control table, and the model scoring results control table. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A system for managing data mining activities in a data mining environment, the data mining environment having data sets, a data mining tool, a data mining model, model training results stored in model training results files, and scoring output data stored in records in model scoring results tables, the system comprising:
-
means for selecting a model scoring results table, wherein the means for selecting includes metadata in a model scoring results control table, the model scoring results control table being related to a data set control table including data set metadata;
means for reading a scoring output data record from the selected model scoring results table;
means for storing the scoring output data record in a managed representation table for scoring results;
means for selecting a model training results file, wherein the means for selecting includes metadata in a mining model control table, the mining model control table being related to the data set control table and the model scoring results control table;
means for reading training results data from the selected model training results file; and
means for storing the training results data in at least one model training results table, the at least one model training results table comprising a relational representation of the training results data from the selected model training results file, one of the at least one model training results table referencing the mining model control table, the managed representation table for scoring results being related to the referencing model training results table, the data set control table, the mining model control table, and the model scoring results control table. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52)
-
-
53. A computer program product for managing data mining activities in a data mining environment, the data mining environment having data sets, a data mining tool, a data mining model, model training results stored in model training results files, and scoring output data stored in records in model scoring results tables, the computer program product comprising:
-
a recording medium;
means, recorded on the recording medium, for selecting a model scoring results table, wherein the means for selecting includes metadata in a model scoring results control table, the model scoring results control table being related to a data set control table including data set metadata;
means, recorded on the recording medium, for reading a scoring output data record from the selected model scoring results table;
means, recorded on the recording medium, for storing the scoring output data record in a managed representation table for scoring results;
means, recorded on the recording medium, for selecting a model training results file, wherein the means for selecting includes metadata in a mining model control table, the mining model control table being related to the data set control table and the model scoring results control table;
means, recorded on the recording medium, for reading training results data from the selected model training results file; and
means, recorded on the recording medium, for storing the training results data in at least one model training results table, the at least one model training results table comprising a relational representation of the training results data from the selected model training results file, one of the at least one model training results table referencing the mining model control table, the managed representation table for scoring results being related to the referencing model training results table, the data set control table, the mining model control table, and the model scoring results control table. - View Dependent Claims (54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78)
-
Specification