Data quality management for profiling, linking, cleansing and migrating data
First Claim
Patent Images
1. A data quality management system comprising:
- a rules repository configured to store profiling data quality rules, cleansing data quality rules, and linking data that links profiling data quality rules to cleansing data quality rules;
a rules management module configured to manage the rules repository by managing the profiling data quality rules stored in the rules repository, managing the cleansing data quality rules stored in the rules repository, and managing the linking data stored in the rules repository; and
a data quality job management module configured to migrate data quality rules from the rules repository to a data quality processing system and manage a data quality process performed by the data quality processing system using the migrated data quality rules, the migrated data quality rules including one or more linked pairs of data quality rules that have a profiling data quality rule linked to a cleansing data quality rule by the linking data and the data quality job management module being configured to control the data quality processing system to execute the one or more linked pairs of data quality rules,wherein the rules management module is configured to manage the profiling data quality rules stored in the rules repository by accessing profiling and cleansing data quality rules inferred from an analysis of data to be profiled or cleansed and storing the inferred profiling and cleansing data quality rules in the rules repository.
1 Assignment
0 Petitions
Accused Products
Abstract
A data quality management system includes a rules repository configured to store profiling data quality rules, cleansing data quality rules, and linking data that links profiling data quality rules to cleansing data quality rules. The data quality management system also includes a rules management module configured to manage the rules repository. The data quality management system further includes a data quality job management module configured to migrate data quality rules from the rules repository to a data quality processing system and manage a data quality process performed by the data quality processing system using the migrated data quality rules.
-
Citations
20 Claims
-
1. A data quality management system comprising:
-
a rules repository configured to store profiling data quality rules, cleansing data quality rules, and linking data that links profiling data quality rules to cleansing data quality rules; a rules management module configured to manage the rules repository by managing the profiling data quality rules stored in the rules repository, managing the cleansing data quality rules stored in the rules repository, and managing the linking data stored in the rules repository; and a data quality job management module configured to migrate data quality rules from the rules repository to a data quality processing system and manage a data quality process performed by the data quality processing system using the migrated data quality rules, the migrated data quality rules including one or more linked pairs of data quality rules that have a profiling data quality rule linked to a cleansing data quality rule by the linking data and the data quality job management module being configured to control the data quality processing system to execute the one or more linked pairs of data quality rules, wherein the rules management module is configured to manage the profiling data quality rules stored in the rules repository by accessing profiling and cleansing data quality rules inferred from an analysis of data to be profiled or cleansed and storing the inferred profiling and cleansing data quality rules in the rules repository. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 14, 15, 16, 17, 20)
-
-
11. A data quality management system comprising:
-
a rules repository configured to store profiling data quality rules, cleansing data quality rules, and linking data that links profiling data quality rules to cleansing data quality rules; a rules management module configured to manage the rules repository by managing the profiling data quality rules stored in the rules repository, managing the cleansing data quality rules stored in the rules repository, and managing the linking data stored in the rules repository; and a data quality job management module configured to migrate data quality rules from the rules repository to a data quality processing system and manage a data quality process performed by the data quality processing system using the migrated data quality rules, the migrated data quality rules including one or more linked pairs of data quality rules that have a profiling data quality rule linked to a cleansing data quality rule by the linking data and the data quality job management module being configured to control the data quality processing system to execute the one or more linked pairs of data quality rules, wherein the rules management module is configured to receive user input defining a search query for data quality rules in the rules repository, perform a search for data quality rules in the rules repository based on the search query, identify data quality rules in the rules repository that match the search query based on performance of the search, present the identified data quality rules with one or more controls for a user to select one of the identified data quality rules, receive user input selecting a data quality rule from among the identified data quality rules, lock the selected data quality rule based on the selection, allow a change to the selected data quality rule based on locking the selected data quality rule, and prevent changes to data quality rules that are not locked. - View Dependent Claims (12, 13)
-
-
18. A method comprising:
-
storing, in a rules repository, profiling data quality rules, cleansing data quality rules, and linking data that links profiling data quality rules to cleansing data quality rules; managing the rules repository by managing the profiling data quality rules stored in the rules repository, managing the cleansing data quality rules stored in the rules repository, and managing the linking data stored in the rules repository; migrating data quality rules from the rules repository to a data quality processing system, the migrated data quality rules including one or more linked pairs of data quality rules that have a profiling data quality rule linked to a cleansing data quality rule by the linking data; managing a data quality process performed by the data quality processing system using the migrated data quality rules, the managing including controlling the data quality processing system to execute the one or more linked pairs of data quality rules, receiving user input defining a search query for data quality rules in the rules repository; performing a search for data quality rules in the rules repository based on the search query; identifying data quality rules in the rules repository that match the search query based on performance of the search; presenting the identified data quality rules with one or more controls for a user to select one of the identified data quality rules; receiving user input selecting a data quality rule from among the identified data quality rules, locking the selected data quality rule based on the selection; allowing a change to the selected data quality rule based on locking the selected data quality rule; and preventing changes to data quality rules that are not locked.
-
-
19. At least one non-transitory computer-readable storage medium encoded with executable instructions that, when executed by at least one processor, cause the at least one processor to perform operations comprising:
-
storing, in a rules repository, profiling data quality rules, cleansing data quality rules, and linking data that links profiling data quality rules to cleansing data quality rules; managing the rules repository by managing the profiling data quality rules stored in the rules repository, managing the cleansing data quality rules stored in the rules repository, and managing the linking data stored in the rules repository; migrating data quality rules from the rules repository to a data quality processing system, the migrated data quality rules including one or more linked pairs of data quality rules that have a profiling data quality rule linked to a cleansing data quality rule by the linking data; managing a data quality process performed by the data quality processing system using the migrated data quality rules, the managing including controlling the data quality processing system to execute the one or more linked pairs of data quality rules, receiving user input defining a search query for data quality rules in the rules repository; performing a search for data quality rules in the rules repository based on the search query; identifying data quality rules in the rules repository that match the search query based on performance of the search; presenting the identified data quality rules with one or more controls for a user to select one of the identified data quality rules; receiving user input selecting a data quality rule from among the identified data quality rules, locking the selected data quality rule based on the selection; allowing a change to the selected data quality rule based on locking the selected data quality rule; receiving user input to delete the selected data quality rule when the selected data quality rule is locked; deleting the selected data quality rule based on the user input to delete the selected data quality rule received when the selected data quality rule is locked; and preventing changes to data quality rules that are not locked.
-
Specification