Intelligent data quality
First Claim
1. A system comprising:
- a processor;
a data profiler coupled to the processor, the data profiler to;
receive a query from a user, the query to indicate a data quality requirement relevant for data management operations;
obtain target data from a plurality of data sources associated with the data quality requirement; and
implement an artificial intelligence component to;
sort the target data into a data cascade, the data cascade to include a plurality of attributes identified by the artificial intelligence component for the target data, each of the attributes from the plurality of attributes being associated with the data quality requirement, wherein the data cascade includes information about an attribute from the plurality of attributes that is linked to another attribute from the plurality of attributes in a sequential manner; and
identify a combination of attributes from the plurality of attributes for generating a data pattern model, the combination including at least one attribute usable for generating the data pattern model;
a data mapper coupled to the processor, the data mapper to;
implement a first cognitive learning operation to;
determine at least one mapping context associated with the data quality requirement from the data cascade and the data pattern model, the mapping context to include a pattern value from the data pattern model and at least one attribute from the data cascade; and
determine a conversion rule from the data pattern model for each of the mapping context associated with the data quality requirement; and
a data cleanser coupled to the processor, the data cleanser to;
obtain the data pattern model for each attribute associated with the data quality requirement;
obtain the conversion rule determined for each of the mapping context associated with the data quality requirement;
establish a data harmonization model corresponding to the data quality requirement by performing a second cognitive learning operation on the obtained data pattern model domain and the obtained conversion rule;
determine a data harmonization index indicative of a level of harmonization achieved in the target data, wherein the data harmonization index provides a quantitative measure of the quality of target data achieved through the selection of the data pattern model, the conversion rule, and the data harmonization model;
modify at least one of the data pattern model, the conversion rule, and the data harmonization model based on the data harmonization index; and
generate a data cleansing result corresponding to the data quality requirement, the data cleansing result comprising the data harmonization model relevant for resolution to the query.
1 Assignment
0 Petitions
Accused Products
Abstract
Examples of an intelligent data quality application are defined. In an example, the system receives a data quality requirement from a user. The system obtains target data from a plurality of data sources. The system implements an artificial intelligence component sort the target data into a data cascade. The data cascade may include a plurality of attributes associated with the data quality requirement. The system may evaluate the data cascade to identify a data pattern model for each of the attributes. The system may implement a first cognitive learning operation to determine a mapping context from the data cascade and a conversion rule from the data pattern model. The system may establish a data harmonization model corresponding to the data quality requirement by performing a second cognitive learning operation. The system may generate a data cleansing result corresponding to the data quality requirement.
16 Citations
20 Claims
-
1. A system comprising:
-
a processor; a data profiler coupled to the processor, the data profiler to; receive a query from a user, the query to indicate a data quality requirement relevant for data management operations; obtain target data from a plurality of data sources associated with the data quality requirement; and implement an artificial intelligence component to; sort the target data into a data cascade, the data cascade to include a plurality of attributes identified by the artificial intelligence component for the target data, each of the attributes from the plurality of attributes being associated with the data quality requirement, wherein the data cascade includes information about an attribute from the plurality of attributes that is linked to another attribute from the plurality of attributes in a sequential manner; and identify a combination of attributes from the plurality of attributes for generating a data pattern model, the combination including at least one attribute usable for generating the data pattern model; a data mapper coupled to the processor, the data mapper to; implement a first cognitive learning operation to; determine at least one mapping context associated with the data quality requirement from the data cascade and the data pattern model, the mapping context to include a pattern value from the data pattern model and at least one attribute from the data cascade; and determine a conversion rule from the data pattern model for each of the mapping context associated with the data quality requirement; and a data cleanser coupled to the processor, the data cleanser to; obtain the data pattern model for each attribute associated with the data quality requirement; obtain the conversion rule determined for each of the mapping context associated with the data quality requirement; establish a data harmonization model corresponding to the data quality requirement by performing a second cognitive learning operation on the obtained data pattern model domain and the obtained conversion rule; determine a data harmonization index indicative of a level of harmonization achieved in the target data, wherein the data harmonization index provides a quantitative measure of the quality of target data achieved through the selection of the data pattern model, the conversion rule, and the data harmonization model; modify at least one of the data pattern model, the conversion rule, and the data harmonization model based on the data harmonization index; and generate a data cleansing result corresponding to the data quality requirement, the data cleansing result comprising the data harmonization model relevant for resolution to the query. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method comprising:
-
receiving, by a processor, a query from a user, the query being indicative of a data quality requirement for data management operations; obtaining, by the processor, target data from a plurality of data sources associated with the data quality requirement; implementing, by the processor, an artificial intelligence component to sort the target data into a data cascade, the data cascade to include a plurality of attributes identified by the artificial intelligence component for the target data, each of the attributes from the plurality of attributes being associated with the data quality requirement, wherein the data cascade includes information about an attribute from the plurality of attributes that is linked to another attribute from the plurality of attributes in a sequential manner; identifying a combination of attributes from the plurality of attributes for generating a data pattern model, the combination including at least one attribute usable for generating the data pattern model; implementing, by the processor, a first cognitive learning operation to determine at least one mapping context associated with the data quality requirement from the data cascade and the data pattern model, the mapping context to include a pattern value from the data pattern model and at least one attribute from the data cascade; determining, by the processor, a conversion rule from the data pattern model for each of the mapping context associated with the data quality requirement; obtaining, by the processor, the data pattern model for each attribute associated with the data quality requirement; obtaining, by the processor, the conversion rule determined for each of the mapping context associated with the data quality requirement; establishing, by the processor, a data harmonization model corresponding to the data quality requirement by performing a second cognitive learning operation on the obtained data pattern model domain and the obtained conversion rule; determining a data harmonization index indicative of a level of harmonization achieved in the target data, wherein the data harmonization index provides a quantitative measure of the quality of target data achieved through the selection of the data pattern model, the conversion rule, and the data harmonization model; modifying at least one of the data pattern model, the conversion rule, and the data harmonization model based on the data harmonization index; and generating, by the processor, a data cleansing result corresponding to the data quality requirement, the data cleansing result comprising the data harmonization model relevant for resolution to the query. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer readable medium including machine readable instructions that are executable by a processor to:
-
receive a query from a user, the query being indicative of a data quality requirement associated with data management operations; obtain target data from a plurality of data sources associated with the data quality requirement; implement an artificial intelligence component to sort the target data into a data cascade, the data cascade to include a plurality of attributes identified by the artificial intelligence component for the target data, each of the attributes from the plurality of attributes being associated with the data quality requirement, wherein the data cascade includes information about an attribute from the plurality of attributes that is linked to another attribute from the plurality of attributes in a sequential manner; identify a combination of attributes from the plurality of attributes for generating a data pattern model, the combination including at least one attribute usable for generating the data pattern model; implement a first cognitive learning operation to determine at least one mapping context associated with the data quality requirement from the data cascade and the data pattern model, the mapping context to include a pattern value from the data pattern model and at least one attribute from the data cascade; determine a conversion rule from the data pattern model for each of the mapping context associated with the data quality requirement; obtain the data pattern model for each attribute associated with the data quality requirement; obtain the conversion rule determined for each of the mapping context associated with the data quality requirement; establish a data harmonization model corresponding to the data quality requirement by performing a second cognitive learning operation on the obtained data pattern model domain and the obtained conversion rule; determine a data harmonization index indicative of a level of harmonization achieved in the target data, wherein the data harmonization index provides a quantitative measure of the quality of target data achieved through the selection of the data pattern model, the conversion rule, and the data harmonization model; modify at least one of the data pattern model, the conversion rule, and the data harmonization model based on the data harmonization index; and generate a data cleansing result corresponding to the data quality requirement, the data cleansing result comprising the data harmonization model relevant for resolution to the query. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification