METHODS AND SYSTEMS FOR DATA CLEANING
First Claim
1. A method for cleaning data stored in a database, the method comprising:
- providing a set of fixing rules, each fixing rule incorporating;
a set of attribute values that capture an error in a plurality of semantically related attribute values, anda deterministic correction which is operable to replace one of the set of attribute values with a correct attribute value to correct the error,wherein the method further comprises;
comparing at least two of the fixing rules with one another to check that the error correction carried out by one fixing rule is consistent with the error correction carried out by another fixing rule.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for cleaning data stored in a database, the method comprising providing a set of fixing rules. Each fixing rule incorporates a set of attribute values that capture an error in a plurality of semantically related attribute values, and a deterministic correction which is operable to replace one of the set of attribute values with a correct attribute value to correct the error. The method further comprises comparing at least two of the fixing rules with one another to check that the error correction carried out by one fixing rule is consistent with the error correction carried out by another fixing rule.
-
Citations
21 Claims
-
1. A method for cleaning data stored in a database, the method comprising:
-
providing a set of fixing rules, each fixing rule incorporating; a set of attribute values that capture an error in a plurality of semantically related attribute values, and a deterministic correction which is operable to replace one of the set of attribute values with a correct attribute value to correct the error, wherein the method further comprises; comparing at least two of the fixing rules with one another to check that the error correction carried out by one fixing rule is consistent with the error correction carried out by another fixing rule. - View Dependent Claims (2, 3, 4, 5, 7, 8, 21)
-
-
6. (canceled)
-
9. A method for cleaning data stored in a database, the method comprising:
-
providing a set of fixing rules, each fixing rule incorporating; a set of attribute values that capture an error in a plurality of semantically related attribute values, and a deterministic correction which is operable to replace one of the set of attribute values with a correct attribute value to correct the error, wherein the method comprises; applying at least one of the fixing rules to a plurality of tuples stored in a database to detect if at least one of the tuples comprises the respective set of attribute values that captures the error and, if the respective set of attribute values is detected, applying the deterministic correction to correct the error in the at least one tuple. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification