Resource Optimization for Parallel Data Integration
First Claim
1. A method of optimizing resources for a parallel data integration job, the method comprising:
- receiving a job request specifying a parallel data integration job to deploy in a grid;
predicting grid resource utilizations for hypothetical runs of the specified job on respective hypothetical grid resource configurations, including automatically predicting grid resource utilizations by a resource optimizer module responsive to a model based on a plurality of actual runs of previous jobs; and
selecting a grid resource configuration for running the parallel data integration job, including the optimizer module automatically selecting a grid resource configuration responsive to the predicted grid resource utilizations and an optimization criterion.
1 Assignment
0 Petitions
Accused Products
Abstract
For optimizing resources for a parallel data integration job, a job request is received, which specifies a parallel data integration job to deploy in a grid. Grid resource utilizations are predicted for hypothetical runs of the specified job on respective hypothetical grid resource configurations. This includes automatically predicting grid resource utilizations by a resource optimizer module responsive to a model based on a plurality of actual runs of previous jobs. A grid resource configuration is selected for running the parallel data integration job, which includes the optimizer module automatically selecting a grid resource configuration responsive to the predicted grid resource utilizations and an optimization criterion.
-
Citations
25 Claims
-
1. A method of optimizing resources for a parallel data integration job, the method comprising:
-
receiving a job request specifying a parallel data integration job to deploy in a grid; predicting grid resource utilizations for hypothetical runs of the specified job on respective hypothetical grid resource configurations, including automatically predicting grid resource utilizations by a resource optimizer module responsive to a model based on a plurality of actual runs of previous jobs; and selecting a grid resource configuration for running the parallel data integration job, including the optimizer module automatically selecting a grid resource configuration responsive to the predicted grid resource utilizations and an optimization criterion. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A program product for parallel data integration job, the program product comprising:
-
a computer readable storage medium having computer readable program code embodied therewith, the computer readable program code comprising; computer readable code configured to receive a job request specifying a parallel data integration job to deploy in a grid; computer readable code configured to predict grid resource utilizations for hypothetical runs of the specified job on respective hypothetical grid resource configurations, including resource optimizer module computer readable code configured to automatically predict grid resource utilizations responsive to a model based on a plurality of actual runs of previous jobs; and computer readable code configured to select a grid resource configuration for running the parallel data integration job, including resource optimizer module computer readable code configured to automatically select a grid resource configuration responsive to the predicted grid resource utilizations and an optimization criterion. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer system comprising:
-
at least one storage system for storing a parallel data integration job resource optimization program; and at least one processor for processing the parallel data integration job resource optimization program, the system being configured with the program and the processor to receive a job request specifying a parallel data integration job to deploy in a grid, predict grid resource utilizations for hypothetical runs of the specified job on respective hypothetical grid resource configurations, including automatically predicting grid resource utilizations by a resource optimizer module responsive to a model based on a plurality of actual runs of previous jobs, and select a grid resource configuration for running the parallel data integration job, including the optimizer module automatically selecting a grid resource configuration responsive to the predicted grid resource utilizations and an optimization criterion. - View Dependent Claims (20, 21, 22, 23, 24, 25)
-
Specification