SYSTEM AND METHOD FOR AUTOMATING ETL APPLICATION
First Claim
Patent Images
1. A system for processing a plurality of ETL jobs comprising:
- a job table including a plurality of job identifiers, each job identifier identifying a job to run, wherein the job table further includes information regarding which jobs to complete before running other jobs;
a priority table including priority information for at least one job identifier in the job table;
wherein if a job completes and a plurality of jobs may now run, the system utilizes priority information in the priority table to determine one of said plurality of jobs to run.
2 Assignments
0 Petitions
Accused Products
Abstract
Novel tools for development and operation of ETL (Extract Transform Load) systems for populating databases. An embodiment uses metadata tables to describe relationships between jobs to run for processing data. These relationships can include parent-child job relation, and priority. These tools create a DataStage Service Architecture (DSA) that helps automate and control the ETL process. Other tools allow developers to easily view and update the ETL process.
74 Citations
24 Claims
-
1. A system for processing a plurality of ETL jobs comprising:
-
a job table including a plurality of job identifiers, each job identifier identifying a job to run, wherein the job table further includes information regarding which jobs to complete before running other jobs; a priority table including priority information for at least one job identifier in the job table; wherein if a job completes and a plurality of jobs may now run, the system utilizes priority information in the priority table to determine one of said plurality of jobs to run. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for processing a plurality of ETL jobs comprising:
-
determining from a job table a plurality of jobs to run, wherein the job table further includes information regarding which jobs to complete before running other jobs; determining from a status table a status for the plurality of jobs to run, wherein the status table includes an indication of which jobs have completed successfully and which jobs need to be run again; determining a subset of jobs to run based on the information from the job table and the status table commencing processing the subset of jobs; if a job in the subset of jobs completes, determining another job to run, wherein the determination includes utilizing job priority information if a plurality of jobs may now run; and updating the status table based on processing of the plurality of jobs. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer readable storage medium, including instructions that when supplied to a processor, cause the processor to perform:
-
determining from a job table a plurality of jobs to run, wherein the job table further includes information regarding which jobs to complete before running other jobs; determining from a status table a status for the plurality of jobs to run, wherein the status table includes an indication of which jobs have completed successfully and which jobs need to be run again; determining a subset of jobs to run based on the information from the job table and the status table commencing processing the subset of jobs; if a job in the subset of jobs completes, determining another job to run, wherein the determination includes utilizing job priority information if a plurality of jobs may now run; and updating the status table based on processing of the plurality of jobs. - View Dependent Claims (22, 23, 24)
-
Specification