×

Identifying and resolving data quality issues amongst information stored across multiple data sources

  • US 10,268,711 B1
  • Filed: 01/26/2016
  • Issued: 04/23/2019
  • Est. Priority Date: 01/26/2016
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising:

  • one or more processors; and

    a memory that stores instructions that are executable by the one or more processors to cause the system to perform operations comprising;

    receiving a query that identifies an object;

    collecting, based at least in part on the query, first data associated with the object from a first data source, the first data including an attribute defined for the object and the first data source containing the first data associated with the object for first service calls received from a first group of devices;

    collecting, based at least in part on the query, second data associated with the object from a second data source, the second data including the attribute defined for the object and the second data source containing the second data associated with the object for second service calls received from a second group of devices, the second data different than the first data;

    comparing the first data to the second data to identify a data quality issue associated with the attribute, wherein the data quality issue includes one of;

    a first type of data quality issue wherein the attribute collected from the first data source contains a first attribute value and the attribute collected from the second data source contains a missing attribute value;

    a second type of data quality issue wherein the attribute collected from the first data source contains the first attribute value that is inconsistent with a second attribute value of the attribute collected from the second data source;

    ora third type of data quality issue wherein the attribute collected from the first data source contains the first attribute that is untranslated to a target language;

    causing a graphical user interface to be output, the graphical user interface visually distinguishing the attribute associated with the data quality issue from other attributes defined for the object that are not associated with a data quality issue and the graphical user interface providing an option to resolve the data quality issue, wherein the graphical user interface visually distinguishes the attribute from other attributes by presenting the attribute as a first graphical element having a first color and presenting at least one of the other attributes as a second graphical element having a second color that is different than the first color;

    receiving, based at least in part on a user selection of the option, an instruction to resolve the data quality issue; and

    taking an action to resolve the data quality issue based on the instruction, wherein the action includes one of;

    copying the first attribute value to the attribute collected from the second data source that contains the missing attribute value in an event the data quality issue is of the first type of data quality issue;

    orreplacing the second attribute value with the first attribute value for the attribute collected from the second data source in an event the data quality issue is of the second type of data quality issue.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×