Method and system for managing data quality
First Claim
1. A computer-implemented system including computer-executable code recorded on a computer-readable medium comprising:
- a data stream manager including;
a data receiver configured via the computer-executable code to obtain a first data stream interval including a first group of data items and a first aggregated data quality value associated with a quality of obtaining the first group of data items and a second data stream interval including a second group of data items and a second aggregated data quality value associated with a quality of obtaining the second group of data items, each data item including one or more data attribute values, each data quality item including one or more data quality attribute values associated with one of the data items of the first group of data items or the second group of data items;
a data selection manager configured via the computer-executable code to select the first aggregated data quality value and a first indicator associating the first aggregated data quality value with the first group of data items, and to select the first group of data items and to select the second aggregated data quality value and a second indicator associating the second aggregated data quality value with the second group of data items and to select the second group of data items;
a data storage manager configured via the computer-executable code to store the first group of data items and the first indicator and the second group of data items and the second indicator in a user table associated with a relational database;
a system table manager configured via the computer-executable code to determine a data quality table associated with the user table based on an entry in a system table; and
a data quality storage manager configured via the computer-executable code to store the first aggregated data quality value and the first indicator and the second aggregated data quality value and the second indicator in the determined data quality table, wherein the data receiver is configured to;
obtain a first jumping window including the first data stream interval including the first group of data items and the first aggregated data quality value, andobtain a second jumping window including the second data stream interval including the second group of data items and the second aggregated data quality value.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system are described for managing data quality. An example method may include obtaining a first data stream interval including a first group of data items and a first aggregated data quality value associated with a quality of obtaining the first group, each data item including data attribute values, each data quality item including data quality attribute values associated with one of the data items. The first aggregated data quality value, a first indicator associating the first aggregated data quality value with the first group, and the first group may be selected. The first group and the first indicator may be stored in a user table of a database. A data quality table associated with the user table may be determined based on an entry in a system table. The first aggregated data quality value and the first indicator may be stored in the data quality table.
16 Citations
24 Claims
-
1. A computer-implemented system including computer-executable code recorded on a computer-readable medium comprising:
-
a data stream manager including; a data receiver configured via the computer-executable code to obtain a first data stream interval including a first group of data items and a first aggregated data quality value associated with a quality of obtaining the first group of data items and a second data stream interval including a second group of data items and a second aggregated data quality value associated with a quality of obtaining the second group of data items, each data item including one or more data attribute values, each data quality item including one or more data quality attribute values associated with one of the data items of the first group of data items or the second group of data items; a data selection manager configured via the computer-executable code to select the first aggregated data quality value and a first indicator associating the first aggregated data quality value with the first group of data items, and to select the first group of data items and to select the second aggregated data quality value and a second indicator associating the second aggregated data quality value with the second group of data items and to select the second group of data items; a data storage manager configured via the computer-executable code to store the first group of data items and the first indicator and the second group of data items and the second indicator in a user table associated with a relational database; a system table manager configured via the computer-executable code to determine a data quality table associated with the user table based on an entry in a system table; and a data quality storage manager configured via the computer-executable code to store the first aggregated data quality value and the first indicator and the second aggregated data quality value and the second indicator in the determined data quality table, wherein the data receiver is configured to; obtain a first jumping window including the first data stream interval including the first group of data items and the first aggregated data quality value, and obtain a second jumping window including the second data stream interval including the second group of data items and the second aggregated data quality value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method comprising:
-
obtaining a first data stream interval including a first group of data items and a first aggregated data quality value associated with a quality of obtaining the first group of data items and a second data stream interval including a second group of data items and a second aggregated data quality value associated with a quality of obtaining the second group of data items, each data item including one or more data attribute values, each data quality item including one or more data quality attribute values associated with one of the data items of the first group of data items or the second group of data items; selecting the first aggregated data quality value and a first indicator associating the first aggregated data quality value with the first group of data items and the second aggregated data quality value and a second indicator associating the second aggregated data quality value with the second group of data items; selecting the first group of data items; selecting the second group of data items; storing the first group of data items and the first indicator and the second group of data items and the second indicator in a user table associated with a relational database; determining a data quality table associated with the user table based on an entry in a system table; and storing the first aggregated data quality value and the first indicator and the second aggregated data quality value and the second indicator in the determined data quality table, wherein the obtaining comprises; obtaining a first jumping window including the first data stream interval including the first group of data items and the first aggregated data quality value, and obtaining a second jumping window including the second data stream interval including the second group of data items and the second aggregated data quality value. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method comprising:
-
requesting a first group of data items and a second group of data items from a relational database; requesting a first aggregated data quality value associated with a quality of obtaining the first group of data items and a first indicator associating the first aggregated data quality value with the first group of data items and a second aggregated data quality value associated with a quality of obtaining the second group of data items and a second indicator associating the second aggregated data quality value with the second group of data items; retrieving the first group of data items and the second group of data items from a user table associated with the relational database; determining a data quality table associated with the user table based on an entry in a system table; retrieving the first aggregated data quality value and the first indicator and the second aggregated data quality value and the second indicator from the determined data quality table; obtaining a first jumping window including a first data stream interval including the first group of data items and the first aggregated data quality value; and obtaining a second jumping window including a second data stream interval including the second group of data items and the second aggregated data quality value.
-
-
22. A computer program product being tangibly embodied on a computer-readable medium and being configured to cause a data processing apparatus to:
-
obtain a first data stream interval including a first group of data items and a first aggregated data quality value associated with a quality of obtaining the first group of data items and a second data stream interval including a second group of data items and a second aggregated data quality value associated with a quality of obtaining the second group of data items, each data item including one or more data attribute values, each data quality item including one or more data quality attribute values associated with one of the data items of the first group of data items or the second group of data items; select the first aggregated data quality value and a first indicator associating the first aggregated data quality value with the first group of data items and select the second aggregated data quality value and a second indicator associating the second aggregated data quality value with the second group of data items; select the first group of data items; select the second group of data items; store the first group of data items and the first indicator and the second group of data items and the second indicator in a user table associated with a relational database; determine a data quality table associated with the user table based on an entry in a system table; and store the first aggregated data quality value and the first indicator and the second aggregated data quality value and the second indicator in the determined data quality table, wherein the data processing apparatus is caused to; obtain a first jumping window including the first data stream interval including the first group of data items and the first aggregated data quality value, and obtain a second jumping window including the second data stream interval including the second group of data items and the second aggregated data quality value. - View Dependent Claims (23, 24)
-
Specification