Data validation using signatures and sampling
First Claim
1. A method that is at least partially executed upon a processor for facilitating validation of a mapping of data between a raw data source and a raw data target, comprising:
- determining at least one of a type of data or amount of the data of the raw data source to utilize for validation;
determining at least one of a sampling rate or an amount of compression of the data for the raw data source and the raw data target;
generating a source data signature of the raw data source and a target data signature of the raw data target utilizing the sampling rate, the amount of compression, or a combination thereof;
obtaining respective data samples of the raw data source and the raw data target;
processing the data samples and data signatures to determine status of a validation process through comparison of the sample the raw source data against corresponding raw target data as well as comparison of the source data signature against the target data signature; and
transforming the signature compare result and the sample compare result into a construct in a logical manner.
2 Assignments
0 Petitions
Accused Products
Abstract
Architecture that facilitates validation of a data mapping of data from a data source to a data target. There is included a signature generation component that generates a source signature of all or a portion of the data source and a target signature of all or a corresponding portion of the data target, and a sampling component that obtains a sample of the source data a corresponding sample of the target data. The data signatures and data samples are compared respectively and processed with a processing component to determine the status of the validation process.
45 Citations
5 Claims
-
1. A method that is at least partially executed upon a processor for facilitating validation of a mapping of data between a raw data source and a raw data target, comprising:
-
determining at least one of a type of data or amount of the data of the raw data source to utilize for validation; determining at least one of a sampling rate or an amount of compression of the data for the raw data source and the raw data target; generating a source data signature of the raw data source and a target data signature of the raw data target utilizing the sampling rate, the amount of compression, or a combination thereof; obtaining respective data samples of the raw data source and the raw data target; processing the data samples and data signatures to determine status of a validation process through comparison of the sample the raw source data against corresponding raw target data as well as comparison of the source data signature against the target data signature; and transforming the signature compare result and the sample compare result into a construct in a logical manner. - View Dependent Claims (2, 3, 4, 5)
-
Specification