Detecting Inconsistent Data Records
First Claim
1. A computer-implemented method for detecting a set of inconsistent data records in a database including multiple records, comprising:
- selecting a data quality rule representing a functional dependency for the database;
transforming the data quality rule into at least one rule vector with hashed components;
selecting a set of attributes of the database;
transforming at least one record of the database selected on the basis of the selected attributes into a record vector with hashed components;
computing a dot product of the rule and record vectors to generate a measure representing violation of the data quality rule by the record.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-implemented method for detecting a set of inconsistent data records in a database including multiple records, comprises selecting a data quality rule representing a functional dependency for the database, transforming the data quality rule into at least one rule vector with hashed components, selecting a set of attributes of the database, transforming at least one record of the database selected on the basis of the selected attributes into a record vector with hashed components, computing a dot product of the rule and record vectors to generate a measure representing violation of the data quality rule by the record.
-
Citations
16 Claims
-
1. A computer-implemented method for detecting a set of inconsistent data records in a database including multiple records, comprising:
-
selecting a data quality rule representing a functional dependency for the database; transforming the data quality rule into at least one rule vector with hashed components; selecting a set of attributes of the database; transforming at least one record of the database selected on the basis of the selected attributes into a record vector with hashed components; computing a dot product of the rule and record vectors to generate a measure representing violation of the data quality rule by the record. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16)
-
-
14. A computer program embedded on a non-transitory tangible computer readable storage medium, the computer program including machine readable instructions that, when executed by a processor, implement a method for detecting a set of inconsistent data records in a database including multiple records, comprising:
-
selecting a set of attributes and generating a set of data vectors from hashcodes of the corresponding attribute values of a set of records of the database; selecting a conditional functional dependency rule and generating a rule vector from hashcodes of the rule; computing a secure dot product of the data and rule vectors to determine an inconsistent record in the database.
-
Specification