QUALITY-DRIVEN ETL DESIGN OPTIMIZATION
First Claim
1. A method for Extract Transform Load (ETL) process optimization on a computing system, comprising:
- obtaining an improvement objective from user input into the computing system, the improvement objectives representing a priority optimization desired by a user for improved ETL flows for an application configured to run in memory of the computing system;
creating an ETL flow in memory of the computing system;
restructuring the ETL flow for flow optimization using a processor of the computing system by applying an algebraic rewriting process, wherein the flow restructuring is based on the improvement objective; and
storing the optimized ETL flow as executable code on a computer readable storage medium.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for quality objective-based ETL pipeline optimization is provided. An improvement objective is obtained from user input into a computing system. The improvement objective represents a priority optimization desired by a user for improved ETL flows for an application designed to run in memory of the computing system. An ETL flow is created in the memory of the computing system. The ETL flow is restructured for flow optimization with a processor of the computing system. The flow restructuring is based on the improvement objective. Flow restructuring can include application of flow rewriting optimization or application of an algebraic rewriting optimization. The optimized ETL flow is stored as executable code on a computer readable storage medium.
99 Citations
20 Claims
-
1. A method for Extract Transform Load (ETL) process optimization on a computing system, comprising:
-
obtaining an improvement objective from user input into the computing system, the improvement objectives representing a priority optimization desired by a user for improved ETL flows for an application configured to run in memory of the computing system; creating an ETL flow in memory of the computing system; restructuring the ETL flow for flow optimization using a processor of the computing system by applying an algebraic rewriting process, wherein the flow restructuring is based on the improvement objective; and storing the optimized ETL flow as executable code on a computer readable storage medium. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for Extract Transform Load (ETL) process optimization on a computing system, comprising:
-
obtaining an improvement objective from user input into the computing system, the improvement objective representing a priority optimization desired by a user for improved ETL flows for an application configured to run in memory of the computing system; determining available computing components for processing ETL flows; creating an ETL flow in memory of the computing system, the ETL flow being implemented on the computing components; restructuring the ETL flow for flow optimization with a processor of the computing system by applying flow rewriting optimization, wherein the flow rewriting is based on the improvement objective and the available computing components; and storing the optimized ETL flow as executable code on a computer readable storage medium. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A method for integration of quality objectives into an Extract Transform Load (ETL) flow on a computing system comprising:
-
gathering quality objectives for ETL data flows configured to run in memory of the computing system; entering the quality objectives into the computing system; creating a conceptual model of ETL data flows including dependencies and constraints among the flows using the memory and a processor; restructuring ETL data flows using the conceptual model and the quality objectives to form a first optimized ETL data flow stored in the memory; creating a logical model of the first optimized ETL data flow using the memory and the processor by expressing the conceptual model as a graph of algebraic operators with data flow between the operators; restructuring ETL data flows with the processor using the logical model and the quality objectives to form a second optimized ETL data flow stored in the memory; and generating a code model of the second optimized ETL data flow including executable code stored on the computer readable storage medium. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification