×

Intelligent data quality

  • US 10,558,629 B2
  • Filed: 05/28/2019
  • Issued: 02/11/2020
  • Est. Priority Date: 05/29/2018
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising:

  • a processor;

    a data profiler coupled to the processor, the data profiler to;

    receive a query from a user, the query to indicate a data quality requirement relevant for data management operations;

    obtain target data from a plurality of data sources associated with the data quality requirement; and

    implement an artificial intelligence component to;

    sort the target data into a data cascade, the data cascade to include a plurality of attributes identified by the artificial intelligence component for the target data, each of the attributes from the plurality of attributes being associated with the data quality requirement, wherein the data cascade includes information about an attribute from the plurality of attributes that is linked to another attribute from the plurality of attributes in a sequential manner; and

    identify a combination of attributes from the plurality of attributes for generating a data pattern model, the combination including at least one attribute usable for generating the data pattern model;

    a data mapper coupled to the processor, the data mapper to;

    implement a first cognitive learning operation to;

    determine at least one mapping context associated with the data quality requirement from the data cascade and the data pattern model, the mapping context to include a pattern value from the data pattern model and at least one attribute from the data cascade; and

    determine a conversion rule from the data pattern model for each of the mapping context associated with the data quality requirement; and

    a data cleanser coupled to the processor, the data cleanser to;

    obtain the data pattern model for each attribute associated with the data quality requirement;

    obtain the conversion rule determined for each of the mapping context associated with the data quality requirement;

    establish a data harmonization model corresponding to the data quality requirement by performing a second cognitive learning operation on the obtained data pattern model domain and the obtained conversion rule;

    determine a data harmonization index indicative of a level of harmonization achieved in the target data, wherein the data harmonization index provides a quantitative measure of the quality of target data achieved through the selection of the data pattern model, the conversion rule, and the data harmonization model;

    modify at least one of the data pattern model, the conversion rule, and the data harmonization model based on the data harmonization index; and

    generate a data cleansing result corresponding to the data quality requirement, the data cleansing result comprising the data harmonization model relevant for resolution to the query.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×