System and method for selecting a data mining modeling algorithm for data mining applications
First Claim
1. A data mining method, comprising:
- providing a computing system comprising a computer readable medium and computing devices electrically coupled through an interface apparatus, wherein a plurality of different data mining modeling algorithms and test data are stored on said computer readable medium, wherein each of said computing devices comprises at least one central processing unit (CPU) and an associated memory device, wherein data has been divided by the computing system into a plurality of data subsets, and wherein each of said associated memory devices comprises a data subset from said plurality of data subsets;
selecting a technique for generating a data mining model applied to each of said data subsets;
running simultaneously, each of said different data mining modeling algorithms using said selected technique to generate an associated data mining model on each of said computing devices;
comparing each of said data mining models on each of said computing devices to said test data to determine a best data model of said data mining models; and
determining, a best data mining modeling algorithm from said different data mining modeling algorithms in accordance with said selected technique, wherein said best data mining modeling algorithm is the data mining modeling algorithm that is associated with said best data mining model.
1 Assignment
0 Petitions
Accused Products
Abstract
A computing system and method for selecting a data mining modeling algorithm. The computing system comprises a computer readable medium and computing devices electrically coupled through an interface apparatus. A plurality of different data mining modeling algorithms and test data are stored on the computer readable medium. Each of the computing devices comprises a data subset from a plurality of data subsets. A technique is selected for generating a data mining model applied to each of the data subsets. Each of the different data mining modeling algorithms is run simultaneously to generate an associated data mining model on each of the computing devices. Each of the data mining models is compared to the test data to determine a best data model. A best data mining modeling algorithm from the different data mining modeling algorithms is selected in accordance with the best data mining model.
53 Citations
36 Claims
-
1. A data mining method, comprising:
-
providing a computing system comprising a computer readable medium and computing devices electrically coupled through an interface apparatus, wherein a plurality of different data mining modeling algorithms and test data are stored on said computer readable medium, wherein each of said computing devices comprises at least one central processing unit (CPU) and an associated memory device, wherein data has been divided by the computing system into a plurality of data subsets, and wherein each of said associated memory devices comprises a data subset from said plurality of data subsets;
selecting a technique for generating a data mining model applied to each of said data subsets;
running simultaneously, each of said different data mining modeling algorithms using said selected technique to generate an associated data mining model on each of said computing devices;
comparing each of said data mining models on each of said computing devices to said test data to determine a best data model of said data mining models; and
determining, a best data mining modeling algorithm from said different data mining modeling algorithms in accordance with said selected technique, wherein said best data mining modeling algorithm is the data mining modeling algorithm that is associated with said best data mining model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computing system comprising a processor coupled to a computer readable medium and computing devices electrically coupled through an interface apparatus, wherein said computer readable medium comprises a plurality of different data mining modeling algorithms, test data, and instructions that when executed by the processor implement a data mining method, wherein each of said computing devices comprises at least one central processing unit (CPU) and an associated memory device, wherein data has been divided by the computing system into a plurality of data subsets, and wherein each of said associated memory devices comprises a data subset from said plurality of data subsets, said method comprising the computer implemented steps of:
-
selecting a technique for generating a data mining model applied to each of said data subsets;
running simultaneously, each of said different data mining modeling algorithms using said selected technique to generate an associated data mining model on each of said computing devices;
comparing each of said data mining models on each of said computing devices to said test data to determine a best data model of said data mining models; and
determining, a best data mining modeling algorithm from said different data mining modeling algorithms in accordance with said selected technique, wherein said best data mining modeling algorithm is the data mining modeling algorithm that is associated with said best data mining model. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A process for integrating computing infrastructure, comprising integrating computer-readable code into a computing system, wherein the code in combination with the computing system comprises a computer readable medium and computing devices electrically coupled through an interface apparatus, wherein a plurality of different data mining modeling algorithms and test data are stored on said computer readable medium, wherein each of said computing devices comprises at least one central processing unit (CPU) and an associated memory device, wherein data has been divided by the computing system into a plurality of data subsets, and wherein each of said associated memory devices comprises a data subset from said plurality of data subsets, and wherein the code in combination with the computing system is adapted to implement a method for performing the steps of:
-
selecting a technique for generating a data mining model applied to each of said data subsets;
running simultaneously, each of said different data mining modeling algorithms using said selected technique to generate an associated data mining model on each of said computing devices;
comparing each of said data mining models on each of said computing devices to said test data to determine a best data model of said data mining models; and
determining, a best data mining modeling algorithm from said different data mining modeling algorithms in accordance with said selected technique, wherein said best data mining modeling algorithm is the data mining modeling algorithm that is associated with said best data mining model. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A computer program product, comprising a computer usable medium having a computer readable program code embodied therein, said computer readable program code comprising an algorithm adapted to implement a data mining method within a computing system, said computing system comprising a computer readable medium and computing devices electrically coupled through an interface apparatus, wherein a plurality of different data mining modeling algorithms and test data are stored on said computer readable medium, wherein each of said computing devices comprises at least one central processing unit (CPU) and an associated memory device, wherein data has been divided by the computing system into a plurality of data subsets, and wherein each of said associated memory devices comprises a data subset from said plurality of data subsets, said method comprising the steps of:
-
selecting a technique for generating a data mining model applied to each of said data subsets;
running simultaneously, each of said different data mining modeling algorithms using said selected technique to generate an associated data mining model on each of said computing devices;
comparing each of said data mining models on each of said computing devices to said test data to determine a best data model of said data mining models; and
determining, a best data mining modeling algorithm from said different data mining modeling algorithms in accordance with said selected technique, wherein said best data mining modeling algorithm is the data mining modeling algorithm that is associated with said best data mining model. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36)
-
Specification