Data validation using signatures and sampling
First Claim
Patent Images
1. A computer system for validating a mapping of data between a raw data source and a raw data target, the computer system comprising:
- a processor coupled to a computer storage medium;
the computer storage medium that stores thereon a plurality of computer software components executable by the processor;
a signature generation component that generates respective data signatures of the raw data source and the raw data target;
a sampling component that samples the raw data source and the raw data target to obtain respective data samples; and
a processing component that processes the data samples and data signatures to determine the status of the validation process,the processing component combining a result of the sample comparison with a result of signature comparison in a logical manner based upon the determined status.
2 Assignments
0 Petitions
Accused Products
Abstract
Architecture that facilitates validation of a data mapping of data from a data source to a data target. There is included a signature generation component that generates a source signature of all or a portion of the data source and a target signature of all or a corresponding portion of the data target, and a sampling component that obtains a sample of the source data a corresponding sample of the target data. The data signatures and data samples are compared respectively and processed with a processing component to determine the status of the validation process.
51 Citations
20 Claims
-
1. A computer system for validating a mapping of data between a raw data source and a raw data target, the computer system comprising:
-
a processor coupled to a computer storage medium; the computer storage medium that stores thereon a plurality of computer software components executable by the processor; a signature generation component that generates respective data signatures of the raw data source and the raw data target; a sampling component that samples the raw data source and the raw data target to obtain respective data samples; and a processing component that processes the data samples and data signatures to determine the status of the validation process, the processing component combining a result of the sample comparison with a result of signature comparison in a logical manner based upon the determined status. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
10. One or more computer-readable media storing computer-useable instructions that, when used by one or more computing devices, cause the one or more computer devices to perform a process for validating a mapping of data between a raw data source and a raw data target, the process comprising:
-
generating respective data signatures of the raw data source and the raw data target; and processing the data signatures and data samples of each of the raw data source and the raw data target to determine the status of the validation process, the validation process having associated therewith a level of confidence, wherein the level of confidence increases by performing at least one of increasing a number of the data signatures or decreasing a level of compression, and wherein the level of compression is associated with at least one of the raw data source and the raw data target.
-
-
20. A computer system for validating a mapping of data between a data source and a data target, the computer system comprising:
-
a processor coupled to a computer storage medium; the computer storage medium that stores thereon a plurality of computer software components executable by the processor; a signature generation component that generates respective data signatures of the data source and the data target; a sampling component that samples the data source and the data target to obtain respective data samples, wherein the data source and data target are raw; a processing component that processes the data samples and data signatures to determine the status of the validation process; a first comparison component that compares the sample of the source data against the corresponding sample target data; and a second comparison component that compares the signature of the source data against the corresponding sample target data, the processing component combining a result of the sample comparison with a result of the signature comparison in a logical manner.
-
Specification