Extract, transform, and load application complexity management framework
First Claim
1. A computer-implemented method of providing extract, transform, and load (ETL) application complexity management, the method comprising:
- receiving, from a source system, a source data asset at a data processing system, the source data asset having a first data format;
transforming the source data asset to a target data asset using the data processing system, the target data asset having a second data format different than the first data format, wherein transforming the source data asset is performed using information from a dependency resource that identifies an extract transform and load job and at least one of the source data asset and the target data asset; and
transmitting the target data asset to a data consuming system;
subsequent to transmitting the target data asset to the data consuming system, performing a quality check with respect to the source data asset or the target data asset; and
modifying the target data asset on the data consuming system based on the quality check.
1 Assignment
0 Petitions
Accused Products
Abstract
Extract, transform, and load application (ETL) complexity management framework systems and methods are described herein. The present disclosure describes systems and methods that reduce the complexity in managing ETL flow and correcting errant data that is subsequently identified. One or more methods include defining an ETL job definition, defining a data asset definition, defining a data asset dependency definition, receiving an ETL flow to provide execution of one or more ETL flow steps, providing retrieval of data from a source data asset, applying a data control to the source asset data, and producing an ETL job registration, a data asset status, a latest asset available date, a data asset consumer identifier, and a target data asset based on at least one of the ETL job definition, the data asset definition, the data dependency definition, and the source asset data.
14 Citations
33 Claims
-
1. A computer-implemented method of providing extract, transform, and load (ETL) application complexity management, the method comprising:
-
receiving, from a source system, a source data asset at a data processing system, the source data asset having a first data format; transforming the source data asset to a target data asset using the data processing system, the target data asset having a second data format different than the first data format, wherein transforming the source data asset is performed using information from a dependency resource that identifies an extract transform and load job and at least one of the source data asset and the target data asset; and transmitting the target data asset to a data consuming system; subsequent to transmitting the target data asset to the data consuming system, performing a quality check with respect to the source data asset or the target data asset; and modifying the target data asset on the data consuming system based on the quality check. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A non-transitory computer-readable medium including one or more sequences of instructions which, when executed by one or more processors, causes:
-
receiving, from a source system, a source data asset at a data processing system, the source data asset having a first data format; transforming the source data asset to a target data asset using the data processing system, the target data asset having a second data format different than the first data format, wherein transforming the source data asset is performed using information from a dependency resource that identifies an extract transform and load job and at least one of the source data asset and the target data asset; and transmitting the target data asset to a data consuming system; subsequent to transmitting the target data asset to the data consuming system, performing a quality check with respect to the source data asset or the target data asset; and modifying the target data asset on the data consuming system based on the quality check. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A system comprising:
-
one or more processors; a non-transitory computer-readable medium including one or more sequences of instructions which, when executed by the one or more processors, causes; receiving, from a source system, a source data asset at a data processing system, the source data asset having a first data format; transforming the source data asset to a target data asset using the data processing system, the target data asset having a second data format different than the first data format, wherein transforming the source data asset is performed using information from a dependency resource that identifies an extract transform and load job and at least one of the source data asset and the target data asset; and transmitting the target data asset to a data consuming system; subsequent to transmitting the target data asset to the data consuming system, performing a quality check with respect to the source data asset or the target data asset; and modifying the target data asset on the data consuming system based on the quality check. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
Specification