×

Data lineage system

  • US 9,075,860 B2
  • Filed: 03/14/2013
  • Issued: 07/07/2015
  • Est. Priority Date: 10/18/2012
  • Status: Active Grant
First Claim
Patent Images

1. A non-transitory computer-readable medium having instructions stored thereon that, when executed by a processor, cause the processor to trace a data lineage of a data warehouse comprising one or more data tables, wherein each data table comprises one or more data records, the tracing comprising:

  • mapping a target data element to one or more source data elements, wherein the target data element comprises a column of a target table definition, and wherein each source data element comprises a column of a source table definition;

    extending the target table definition to comprise one or more auxiliary columns to store one or more source surrogate keys;

    storing, for each source data element, a data lineage mapping system record within a data lineage mapping system table that represents the mapping of the target data element and a corresponding source data element together with a column identity of an auxiliary column in a target table that stores a source surrogate key;

    storing one or more source surrogate key values within one or more auxiliary columns of a target data record at the time the target data record is created or modified;

    storing, for each target data record, one or more shadow system records within a shadow system table that represents a mapping of a source data record and a corresponding target data record source surrogate key value;

    wherein the data lineage comprises the one or more data lineage mapping system records, the one or more shadow system records, and the one or more source surrogate keys;

    storing a filter column identity within a first auxiliary column of the target data record, wherein the filter column identity identifies a filter column of the source data record; and

    storing a filter value within a second auxiliary column of the target data record, wherein the filter value is a filter value of the source data record.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×