Method and system for processing data for database modification
First Claim
1. A method for processing data for database modification, the method comprising:
- receiving a first set of data;
performing a processing chain comprising a plurality of consecutive jobs to transform the first set of data into transformed data;
modifying a database with respect to the transformed data;
setting a target processing time for performance of the consecutive jobs;
before the consecutive jobs are launched, applying a first configuration defining a first number of parallel instances for each of the consecutive jobs; and
if an actual remaining processing time for the consecutive jobs uncompleted in the processing chain is outside of a range of acceptable remaining processing times relative to the target processing time, applying a second configuration defining a second number of parallel instances for each of the consecutive jobs uncompleted in the processing chain and including the first number of parallel instances for each of the consecutive jobs completed in the processing chain, the second number of parallel instances in the second configuration for at least one of the consecutive jobs uncompleted in the processing chain differing from the first number of parallel instances in the first configuration,wherein the first configuration and the second configuration are determined based upon information comprising historical data and constraint data, and the historical data comprises information on previous executions of processing chains for sets of data of a same type as the first set of data.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for processing data for database modification, include receiving a set of data, performing a processing chain including a plurality of consecutive jobs to transform the set of data into transformed data, modifying a production database with respect to the transformed data and further include the steps of setting a target processing time for the performance of the consecutive jobs, before a launch of a first job, applying an original configuration as current configuration defining a parallelization level for each of the consecutive jobs, before a launch of at least one further job after the first job, upon an actual remaining processing time being out of a range of acceptable remaining processing times, applying an adapted configuration as new current configuration defining an adapted parallelization level for each of the jobs remaining in the processing chain, the adapted configuration differing from the current configuration.
-
Citations
30 Claims
-
1. A method for processing data for database modification, the method comprising:
-
receiving a first set of data; performing a processing chain comprising a plurality of consecutive jobs to transform the first set of data into transformed data; modifying a database with respect to the transformed data; setting a target processing time for performance of the consecutive jobs; before the consecutive jobs are launched, applying a first configuration defining a first number of parallel instances for each of the consecutive jobs; and if an actual remaining processing time for the consecutive jobs uncompleted in the processing chain is outside of a range of acceptable remaining processing times relative to the target processing time, applying a second configuration defining a second number of parallel instances for each of the consecutive jobs uncompleted in the processing chain and including the first number of parallel instances for each of the consecutive jobs completed in the processing chain, the second number of parallel instances in the second configuration for at least one of the consecutive jobs uncompleted in the processing chain differing from the first number of parallel instances in the first configuration, wherein the first configuration and the second configuration are determined based upon information comprising historical data and constraint data, and the historical data comprises information on previous executions of processing chains for sets of data of a same type as the first set of data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system comprising:
-
at least one data processor; and
program code configured upon execution by the at least one processor to process data for database modification by;receiving a first set of data; performing a processing chain comprising a plurality of consecutive jobs to transform the first set of data into transformed data; modifying a database with respect to the transformed data; setting a target processing time for performance of the consecutive jobs; before the consecutive jobs are launched, applying a first configuration defining a first number of parallel instances for each of the consecutive jobs; and if an actual remaining processing time for the consecutive jobs uncompleted in the processing chain is outside of a range of acceptable remaining processing times relative to the target processing time, applying a second configuration defining a second number of parallel instances for each of the consecutive jobs uncompleted in the processing chain and including the first number of parallel instances for each of the consecutive jobs completed in the processing chain, the second number of parallel instances in the second configuration for at least one of the consecutive jobs uncompleted in the processing chain differing from the first number of parallel instances in the first configuration, wherein the first configuration and the second configuration are determined based upon information comprising historical data and constraint data, and the historical data comprises information on previous executions of processing chains for sets of data of a same type as the first set of data. - View Dependent Claims (20, 21, 22, 23, 24)
-
-
25. A computer program product comprising:
-
a non-transitory computer-readable storage medium; and a computer program stored on the storage medium, the computer program comprising instructions that, when executed on a computer, cause the computer to process data for database modification by; receiving a first set of data; performing a processing chain comprising a plurality of consecutive jobs to transform the first set of data into transformed data; modifying a database with respect to the transformed data; setting a target processing time for performance of the consecutive jobs; before the consecutive jobs are launched, applying a first configuration defining a first number of parallel instances for each of the consecutive jobs; and if an actual remaining processing time for the consecutive jobs uncompleted in the processing chain is outside of a range of acceptable remaining processing times relative to the target processing time, applying a second configuration defining a second number of parallel instances for each of the consecutive jobs uncompleted in the processing chain and including the first number of parallel instances for each of the consecutive jobs completed in the processing chain, the second number of parallel instances in the second configuration for at least one of the consecutive jobs uncompleted in the processing chain differing from the first number of parallel instances in the first configuration, wherein the first configuration and the second configuration are determined based upon information comprising historical data and constraint data, and the historical data comprises information on previous executions of processing chains for sets of data of a same type as the first set of data. - View Dependent Claims (26, 27, 28, 29, 30)
-
Specification