Data quality management for profiling, linking, cleansing, and migrating data
First Claim
Patent Images
1. A data quality management system comprising:
- a rules repository configured to store profiling data quality rules, cleansing data quality rules, and linking data that links profiling data quality rules to cleansing data quality rules, the linking data defining a particular cleansing data quality rule to be automatically executed on data after identification of at least one potential error in the data uncovered during execution of a corresponding particular profiling data quality rule on the data;
a rules management module configured to manage the rules repository by managing the profiling data quality rules stored in the rules repository, managing the cleansing data quality rules stored in the rules repository, and managing the linking data stored in the rules repository; and
a data quality job management module configured to migrate data quality rules from the rules repository to a data quality processing system and manage a data quality process performed by the data quality processing system using the migrated data quality rules, the migrated data quality rules including one or more linked pairs of data quality rules that each have a profiling data quality rule linked to a cleansing data quality rule by the linking data, the one or more linked pairs of data quality rules including a pair with the particular profiling data quality rule and the particular cleansing data quality rule, and the data quality job management module being configured to control the data quality processing system to execute the particular profiling data quality rule on particular data and, in response to determining that the particular data includes at least one potential error during the execution of the particular profiling data quality rule, execute the particular cleansing data quality rule on the particular data.
1 Assignment
0 Petitions
Accused Products
Abstract
A data quality management system includes a rules repository configured to store profiling data quality rules, cleansing data quality rules, and linking data that links profiling data quality rules to cleansing data quality rules. The data quality management system also includes a rules management module configured to manage the rules repository. The data quality management system further includes a data quality job management module configured to migrate data quality rules from the rules repository to a data quality processing system and manage a data quality process performed by the data quality processing system using the migrated data quality rules.
-
Citations
57 Claims
-
1. A data quality management system comprising:
-
a rules repository configured to store profiling data quality rules, cleansing data quality rules, and linking data that links profiling data quality rules to cleansing data quality rules, the linking data defining a particular cleansing data quality rule to be automatically executed on data after identification of at least one potential error in the data uncovered during execution of a corresponding particular profiling data quality rule on the data; a rules management module configured to manage the rules repository by managing the profiling data quality rules stored in the rules repository, managing the cleansing data quality rules stored in the rules repository, and managing the linking data stored in the rules repository; and a data quality job management module configured to migrate data quality rules from the rules repository to a data quality processing system and manage a data quality process performed by the data quality processing system using the migrated data quality rules, the migrated data quality rules including one or more linked pairs of data quality rules that each have a profiling data quality rule linked to a cleansing data quality rule by the linking data, the one or more linked pairs of data quality rules including a pair with the particular profiling data quality rule and the particular cleansing data quality rule, and the data quality job management module being configured to control the data quality processing system to execute the particular profiling data quality rule on particular data and, in response to determining that the particular data includes at least one potential error during the execution of the particular profiling data quality rule, execute the particular cleansing data quality rule on the particular data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method comprising:
-
storing, in a rules repository, profiling data quality rules, cleansing data quality rules, and linking data that links profiling data quality rules to cleansing data quality rules, the linking data defining a particular cleansing data quality rule to be automatically executed on data after identification of at least one potential error in the data uncovered during execution of a corresponding particular profiling data quality rule on the data; managing the rules repository by managing the profiling data quality rules stored in the rules repository, managing the cleansing data quality rules stored in the rules repository, and managing the linking data stored in the rules repository; migrating data quality rules from the rules repository to a data quality processing system, the migrated data quality rules including one or more linked pairs of data quality rules that each have a profiling data quality rule linked to a cleansing data quality rule by the linking data, the one or more linked pairs of data quality rules including a pair with the particular profiling data quality rule and the particular cleansing data quality rule; and managing a data quality process performed by the data quality processing system using the migrated data quality rules, the managing including controlling the data quality processing system to execute the particular profiling data quality rule on particular data and, in response to determining that the particular data includes at least one potential error during the execution of the particular profiling data quality rule, execute the particular cleansing data quality rule on the particular data. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. At least one non-transitory computer-readable storage medium encoded with executable instructions that, when executed by at least one processor, cause the at least one processor to perform operations comprising:
-
storing, in a rules repository, profiling data quality rules, cleansing data quality rules, and linking data that links profiling data quality rules to cleansing data quality rules, the linking data defining a particular cleansing data quality rule to be automatically executed on data after identification of at least one potential error in the data uncovered during execution of a corresponding particular profiling data quality rule on the data; managing the rules repository by managing the profiling data quality rules stored in the rules repository, managing the cleansing data quality rules stored in the rules repository, and managing the linking data stored in the rules repository; migrating data quality rules from the rules repository to a data quality processing system, the migrated data quality rules including one or more linked pairs of data quality rules that each have a profiling data quality rule linked to a cleansing data quality rule by the linking data, the one or more linked pairs of data quality rules including a pair with the particular profiling data quality rule and the particular cleansing data quality rule; and managing a data quality process performed by the data quality processing system using the migrated data quality rules, the managing including controlling the data quality processing system to execute the particular profiling data quality rule on particular data and, in response to determining that the particular data includes at least one potential error during the execution of the particular profiling data quality rule, execute the particular cleansing data quality rule on the particular data. - View Dependent Claims (40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57)
-
Specification