Extract, transform, and load application complexity management framework
First Claim
1. A computer-implemented method of managing one or more processes providing extract, transform, and load (ETL) application complexity management comprising:
- defining an ETL job definition;
defining a data asset definition;
defining a data asset dependency definition;
receiving an ETL flow to provide execution of one or more ETL flow steps;
retrieving data from a source data asset;
applying a data control to the source asset data;
producing an ETL job registration and a data asset consumer identifier, based on at least one of the ETL job definition, the data asset definition, the data asset dependency definition, and the source asset data, wherein the data asset consumer identifier includes a roll back plan to remove the effect of one or more errant source data assets; and
updating a data asset status, a latest asset available date, and a target data asset; and
recognizing when it is a necessary to restart and re-initialize target data assets based on the execution of the ETL flow steps.
1 Assignment
0 Petitions
Accused Products
Abstract
Extract, transform, and load application (ETL) complexity management framework systems and methods are described herein. The present disclosure describes systems and methods that reduce the complexity in managing ETL flow and correcting errant data that is subsequently identified. One or more methods include defining an ETL job definition, defining a data asset definition, defining a data asset dependency definition, receiving an ETL flow to provide execution of one or more ETL flow steps, providing retrieval of data from a source data asset, applying a data control to the source asset data, and producing an ETL job registration, a data asset status, a latest asset available date, a data asset consumer identifier, and a target data asset based on at least one of the ETL job definition, the data asset definition, the data dependency definition, and the source asset data.
37 Citations
13 Claims
-
1. A computer-implemented method of managing one or more processes providing extract, transform, and load (ETL) application complexity management comprising:
-
defining an ETL job definition; defining a data asset definition; defining a data asset dependency definition; receiving an ETL flow to provide execution of one or more ETL flow steps; retrieving data from a source data asset; applying a data control to the source asset data; producing an ETL job registration and a data asset consumer identifier, based on at least one of the ETL job definition, the data asset definition, the data asset dependency definition, and the source asset data, wherein the data asset consumer identifier includes a roll back plan to remove the effect of one or more errant source data assets; and updating a data asset status, a latest asset available date, and a target data asset; and recognizing when it is a necessary to restart and re-initialize target data assets based on the execution of the ETL flow steps. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A non-transitory machine readable medium having a set of machine readable instructions stored thereon for execution by a processor to cause a machine to perform a method of managing one or more processes providing extract, transform, and load (ETL) application complexity management comprising:
-
receiving one or more data asset incidents associated with a particular data asset, one or more data asset consumers associated with the particular data asset, and one or more data asset statuses associated with the particular data asset; producing a roll back plan to remove an effect of one or more errant source data assets from one or more target data asset consumers, wherein the removal of the effects of errant data recalculates totals and counts within one or more data sets; executing the roll back plan and a roll forward plan; and replacing one or more subsets of errant target data assets, each containing at least one data item, with corrected data. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13)
-
Specification