Extract, transform, and load application complexity management framework
First Claim
1. A computer-implemented method of providing extract, transform, and load (ETL) application complexity management, the method comprising:
- receiving a source data asset at a data processing system, the source data set having a first data format;
transforming the source data asset to a target data asset using the data processing system, the target data set having a second data format different than the first data format;
transmitting the target data set to a data consuming system;
receiving incident information regarding the source data asset, the incident information identifying errant data associated with the source data asset;
producing a roll back plan to remove an effect of the errant data from the data consuming system;
executing the roll back plan; and
replacing at least a portion of data of the target data asset with corrected data.
1 Assignment
0 Petitions
Accused Products
Abstract
Extract, transform, and load application (ETL) complexity management framework systems and methods are described herein. The present disclosure describes systems and methods that reduce the complexity in managing ETL flow and correcting errant data that is subsequently identified. One or more methods include defining an ETL job definition, defining a data asset definition, defining a data asset dependency definition, receiving an ETL flow to provide execution of one or more ETL flow steps, providing retrieval of data from a source data asset, applying a data control to the source asset data, and producing an ETL job registration, a data asset status, a latest asset available date, a data asset consumer identifier, and a target data asset based on at least one of the ETL job definition, the data asset definition, the data dependency definition, and the source asset data.
-
Citations
10 Claims
-
1. A computer-implemented method of providing extract, transform, and load (ETL) application complexity management, the method comprising:
-
receiving a source data asset at a data processing system, the source data set having a first data format; transforming the source data asset to a target data asset using the data processing system, the target data set having a second data format different than the first data format; transmitting the target data set to a data consuming system; receiving incident information regarding the source data asset, the incident information identifying errant data associated with the source data asset; producing a roll back plan to remove an effect of the errant data from the data consuming system; executing the roll back plan; and replacing at least a portion of data of the target data asset with corrected data. - View Dependent Claims (2, 3)
-
-
4. A computer-implemented method of providing extract, transform, and load (ETL) application complexity management, the method comprising:
-
receiving information about a data asset incident relating to a data asset error, a data asset consumer associated with a particular data asset, and a data asset status associated with the particular data asset to determine a data asset dependency; producing a roll back plan, based on the data asset dependency, to remove the effect of the data asset error from one or more target data asset consumers; executing the roll back plan; and replacing one or more subsets of target data assets, each containing at least one data item, with corrected data to remove the effects of the data asset error, wherein replacing the one or more subsets of target data assets with corrected data to remove the effects of the data asset error comprises recalculating totals and counts within one or more data sets.
-
-
5. A computer-implemented method of providing extract, transform, and load (ETL) application complexity management, the method comprising:
-
receiving information about a data asset incident relating to a data asset error, a data asset consumer associated with a particular data asset, and a data asset status associated with the particular data asset to determine a data asset dependency; producing a roll back plan, based on the data asset dependency, to remove the effect of the data asset error from one or more target data asset consumers; executing the roll back plan; and replacing one or more subsets of target data assets, each containing at least one data item, with corrected data to remove the effects of the data asset error, wherein replacing the one or more subsets of target data assets with corrected data to remove the effects of the data asset error comprises removing a subset of one or more target data assets.
-
-
6. A computer-implemented method of providing extract, transform, and load (ETL) application complexity management, the method comprising:
-
receiving information about a data asset incident relating to a data asset error, a data asset consumer associated with a particular data asset, and a data asset status associated with the particular data asset to determine a data asset dependency; producing a roll back plan, based on the data asset dependency, to remove the effect of the data asset error from one or more target data asset consumers; executing the roll back plan; and replacing one or more subsets of target data assets, each containing at least one data item, with corrected data to remove the effects of the data asset error, wherein replacing the one or more subsets of target data assets with corrected data comprises providing corrected data to the target data asset to replace a subset of data in one or more target data asset consumers. - View Dependent Claims (7, 8, 9, 10)
-
Specification