Data quality management for profiling, linking, cleansing, and migrating data
First Claim
Patent Images
1. A data quality management system comprising:
- a rules repository configured to store data quality rules that relate to at least one of profiling or cleansing data;
a rules management module configured to manage the rules repository by accessing data to be at least one of profiled or cleansed, setting parameters for rule discovery, analyzing the accessed data based on the set parameters for rule discovery, discovering, based on the analysis, at least one data quality rule determined to be appropriate for at least one of profiling or cleansing the accessed data, and storing the at least one discovered data quality rule in the rules repository with other data quality rules; and
a data quality job management module configured to migrate data quality rules from the rules repository to a data quality processing system and manage a data quality process performed by the data quality processing system using the migrated data quality rules, the migrated data quality rules including the at least one discovered data quality rule and the other data quality rules and the data quality job management module being configured to control the data quality processing system to execute the at least one discovered data quality rule and the other data quality rules.
1 Assignment
0 Petitions
Accused Products
Abstract
A data quality management system includes a rules repository configured to store profiling data quality rules, cleansing data quality rules, and linking data that links profiling data quality rules to cleansing data quality rules. The data quality management system also includes a rules management module configured to manage the rules repository. The data quality management system further includes a data quality job management module configured to migrate data quality rules from the rules repository to a data quality processing system and manage a data quality process performed by the data quality processing system using the migrated data quality rules.
32 Citations
20 Claims
-
1. A data quality management system comprising:
-
a rules repository configured to store data quality rules that relate to at least one of profiling or cleansing data; a rules management module configured to manage the rules repository by accessing data to be at least one of profiled or cleansed, setting parameters for rule discovery, analyzing the accessed data based on the set parameters for rule discovery, discovering, based on the analysis, at least one data quality rule determined to be appropriate for at least one of profiling or cleansing the accessed data, and storing the at least one discovered data quality rule in the rules repository with other data quality rules; and a data quality job management module configured to migrate data quality rules from the rules repository to a data quality processing system and manage a data quality process performed by the data quality processing system using the migrated data quality rules, the migrated data quality rules including the at least one discovered data quality rule and the other data quality rules and the data quality job management module being configured to control the data quality processing system to execute the at least one discovered data quality rule and the other data quality rules. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A data quality management system comprising:
-
a rules repository configured to store data quality rules that relate to at least one of profiling or cleansing data; a rules management module configured to manage the rules repository by managing the data quality rules that relate to at least one of profiling or cleansing data; and a data quality job management module configured to migrate data quality rules from the rules repository to a data quality processing system, manage a data quality process performed by the data quality processing system using the migrated data quality rules, and control the data quality processing system to execute the migrated data quality rules, wherein the rules management module is configured to enable searching for and modification of data quality rules stored in the rules repository by; receiving user input defining a search query for data quality rules in the rules repository, performing a search for data quality rules in the rules repository based on the search query, identifying data quality rules in the rules repository that match the search query based on performance of the search, presenting the identified data quality rules with one or more controls for a user to select one of the identified data quality rules, receiving user input selecting a data quality rule from among the identified data quality rules, locking the selected data quality rule based on the selection, allowing a modification to the selected data quality rule based on locking the selected data quality rule, and preventing changes to the identified data quality rules that are not locked. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A data quality management system comprising:
-
a rules repository configured to store data quality rules that relate to at least one of profiling or cleansing data; a rules management module configured to manage the rules repository by organizing the data quality rules that relate to at least one of profiling or cleansing data by industry and storing, in the rules repository, the data quality rules organized by industry; and a data quality job management module configured to migrate data quality rules from the rules repository to a data quality processing system and manage a data quality process performed by the data quality processing system using the migrated data quality rules by; receiving a user selection of a particular industry among industries by which the data quality rules are organized; identifying a subset of the data quality rules that are relevant to the particular industry; presenting, to a user for selection, the identified subset of the data quality rules; receiving user input selecting, from among the identified subset of the data quality rules, data quality rules to migrate to the data quality processing system; transforming the selected data quality rules to a format suitable for the data quality processing system; sending the transformed data quality rules to the data quality processing system; and controlling the data quality processing system to execute the transformed data quality rules. - View Dependent Claims (17, 18, 19, 20)
-
Specification