Data asset reconstruction
First Claim
Patent Images
1. A system for data asset reconstruction, comprising:
- a memory having computer readable instructions; and
a processor for executing the computer readable instructions, the computer readable instructions including;
receiving a data lineage for a data asset, wherein the data lineage identifies a data source connected to the data asset by an intermediate process;
determining whether the data source is downstream or upstream from a lost data asset;
based on determining that the data source is downstream from the lost data asset;
determining whether the intermediate process can be used to reconstruct the lost data asset; and
based on determining that the intermediate process can be used to reconstruct the lost data asset, including determining that the intermediate process is reversible, reconstructing the lost data asset by reversing the intermediate process from the data source to yield the lost data asset,wherein the intermediate process is reversible if the intermediate process is capable of being run in reverse to create the lost data asset as output using the data source as input; and
based on determining that the data source is upstream from the lost data asset;
determining whether the intermediate process can be used to reconstruct the lost data asset; and
based on determining that the intermediate process can be used to reconstruct the lost data asset, including determining that the intermediate process is re-runnable, reconstructing the lost data asset by re-running the intermediate process from the data source to yield the lost data asset,wherein the intermediate process is re-runnable if the intermediate process did not change the data source or if the intermediate process is a known one-to-one function.
1 Assignment
0 Petitions
Accused Products
Abstract
According to an aspect, data asset reconstruction includes receiving a data lineage for a data asset, where the data lineage identifies a data source connected to the data asset by an intermediate process. It is determined whether the intermediate process can be used to reconstruct a lost data asset. Based on determining that the intermediate process can be used to reconstruct the lost data asset, reconstructing the lost data asset with the intermediate process and the data source.
-
Citations
14 Claims
-
1. A system for data asset reconstruction, comprising:
-
a memory having computer readable instructions; and a processor for executing the computer readable instructions, the computer readable instructions including; receiving a data lineage for a data asset, wherein the data lineage identifies a data source connected to the data asset by an intermediate process; determining whether the data source is downstream or upstream from a lost data asset; based on determining that the data source is downstream from the lost data asset; determining whether the intermediate process can be used to reconstruct the lost data asset; and based on determining that the intermediate process can be used to reconstruct the lost data asset, including determining that the intermediate process is reversible, reconstructing the lost data asset by reversing the intermediate process from the data source to yield the lost data asset, wherein the intermediate process is reversible if the intermediate process is capable of being run in reverse to create the lost data asset as output using the data source as input; and based on determining that the data source is upstream from the lost data asset; determining whether the intermediate process can be used to reconstruct the lost data asset; and based on determining that the intermediate process can be used to reconstruct the lost data asset, including determining that the intermediate process is re-runnable, reconstructing the lost data asset by re-running the intermediate process from the data source to yield the lost data asset, wherein the intermediate process is re-runnable if the intermediate process did not change the data source or if the intermediate process is a known one-to-one function. - View Dependent Claims (2, 3, 4, 5, 14)
-
-
6. A method for data asset reconstruction comprising:
-
obtaining, by a processor, a data lineage for a data asset at a memory location, wherein the data lineage identifies a data source related to the data asset by an intermediate process; determining whether the data source is downstream or upstream from a lost data asset; based on determining that the data source is downstream from the lost data asset; analyzing, by the processor, the intermediate process to determine whether it can be used to reconstruct the lost data asset; and based on determining that the intermediate process can be used to reconstruct the lost data asset, including determining that the intermediate process is reversible, reconstructing, the processor, the lost data asset by reversing the intermediate process from the data source to yield the lost data asset, wherein the intermediate process is reversible if the intermediate process is capable of being run in reverse to create the lost data asset as output using the data source as input; and based on determining that the data source is upstream from the lost data asset; determining whether the intermediate process can be used to reconstruct the lost data asset; and based on determining that the intermediate process can be used to reconstruct the lost data asset, including determining that the intermediate process is re-runnable, reconstructing the lost data asset by re-running the intermediate process from the data source to yield the lost data asset, wherein the intermediate process is re-runnable if the intermediate process did not change the data source or if the intermediate process is a known one-to-one function. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A computer program product comprising:
a non-transitory computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to perform a method comprising; receiving a data lineage for a data asset, wherein the data lineage identifies a data source connected to the data asset by an intermediate process; determining whether the data source is downstream or upstream from a lost data asset; based on determining that the data source is downstream from the lost data asset; determining whether the intermediate process can be used to reconstruct the lost data asset; and based on determining that the intermediate process can be used to reconstruct the lost data asset, including determining that the intermediate process is reversible, reconstructing the lost data asset by reversing the intermediate process from the data source to yield the lost data asset, wherein the intermediate process is reversible if the intermediate process is capable of being run in reverse to create the lost data asset as output using the data source as input; and based on determining that the data source is upstream from the lost data asset; determining whether the intermediate process can be used to reconstruct the lost data asset; and based on determining that the intermediate process can be used to reconstruct the lost data asset, including determining that the intermediate process is re-runnable, reconstructing the lost data asset by re-running the intermediate process from the data source to yield the lost data asset, wherein the intermediate process is re-runnable if the intermediate process did not change the data source or if the intermediate process is a known one-to-one function. - View Dependent Claims (12, 13)
Specification