Systems and methods for data quality management
First Claim
1. A system for performing data processing on a structured data set, comprisinga query mechanism for providing logical operations for selectively processing said structured data set to generate a query result signal,a memory device having storage for an error model, said error model comprising error model data representative of error in said structured data set and probability data representative of a probability distribution of said error in said structured data set, anda propagation monitor for detecting propagation of said error from said structured data set to said query result signal and for generating in response thereto an error measure signal representative of error in said query result signal.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods that model and measure the propagation of error within information systems. The invention provides data management systems that determine an error measure that represent the accuracy, or inaccuracy of a query result achieved for processing a structured data set. In one embodiment, the invention provides systems that have a model of error which exists within a structured data set. The system can further include an error propagation monitor that processes the error model and the structured data set to determine errors within the structured data set that will propagate to a query result generated by performing a query process on the structured data set. The propagated error represents the error that exists within the query result signal.
93 Citations
29 Claims
-
1. A system for performing data processing on a structured data set, comprising
a query mechanism for providing logical operations for selectively processing said structured data set to generate a query result signal, a memory device having storage for an error model, said error model comprising error model data representative of error in said structured data set and probability data representative of a probability distribution of said error in said structured data set, and a propagation monitor for detecting propagation of said error from said structured data set to said query result signal and for generating in response thereto an error measure signal representative of error in said query result signal.
-
14. A method for measuring error in a query result signal generated from a structured data set, comprising the steps of
providing an error model representative of error in said structured data set and probability data representative of a probability distribution for said error in said structured data set, identifying an instruction signal representative of logical operations for processing said structured data set to generate said query result signal, and processing said structured data set and error model as a function of said instruction signal to generate an error measure representative of error in said query result signal.
Specification