Method and system for including data quality in data streams
First Claim
1. A computer-implemented system including computer-executable code recorded on a computer-readable medium comprising:
- a data stream manager including;
a data manager configured via the computer-executable code to obtain a first group of data items and a second group of data items, each data item including one or more data attribute values;
a data quality manager configured via the computer-executable code to determine a first group of data quality items and a second group of data quality items, each data quality item including one or more data quality attribute values associated with one of the data items of the first group of data items or the second group of data items,an aggregation manager configured via the computer-executable code to determine a first aggregated data quality value based on the first group of data quality items and a second aggregated data quality value based on the second group of data quality items, anda stream interval manager configured via the computer-executable code to output a first data stream interval including the first group of data items and the first aggregated data quality value and to output a second data stream interval including the second group of data items and the second aggregated data quality value, wherein the stream interval manager is configured to;
output a first jumping window including the first data stream interval including the first group of data items and the first aggregated data quality value, andoutput a second jumping window including the second data stream interval including the second group of data items and the second aggregated data quality value.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system are described for including data quality in data streams. An example method may include obtaining a first group of data items, each data item including one or more data attribute values. A first group of data quality items may be determined, each data quality item including one or more data quality attribute values associated with one of the data items of the first group. A first aggregated data quality value may be determined based on the first group of data quality items. A first data stream interval including the first group of data items and the first aggregated data quality value may be output.
-
Citations
20 Claims
-
1. A computer-implemented system including computer-executable code recorded on a computer-readable medium comprising:
-
a data stream manager including; a data manager configured via the computer-executable code to obtain a first group of data items and a second group of data items, each data item including one or more data attribute values; a data quality manager configured via the computer-executable code to determine a first group of data quality items and a second group of data quality items, each data quality item including one or more data quality attribute values associated with one of the data items of the first group of data items or the second group of data items, an aggregation manager configured via the computer-executable code to determine a first aggregated data quality value based on the first group of data quality items and a second aggregated data quality value based on the second group of data quality items, and a stream interval manager configured via the computer-executable code to output a first data stream interval including the first group of data items and the first aggregated data quality value and to output a second data stream interval including the second group of data items and the second aggregated data quality value, wherein the stream interval manager is configured to; output a first jumping window including the first data stream interval including the first group of data items and the first aggregated data quality value, and output a second jumping window including the second data stream interval including the second group of data items and the second aggregated data quality value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising:
-
obtaining a first group of data items and a second group of data items, each data item including one or more data attribute values; determining a first group of data quality items and a second group of data quality items, each data quality item including one or more data quality attribute values associated with one of the data items of the first group of data items or the second group of data items; determining a first aggregated data quality value based on the first group of data quality items and a second aggregated data quality value based on the second group of data quality items; and outputting a first data stream interval including the first group of data items and the first aggregated data quality value and a second data stream interval including the second group of data items and the second aggregated data quality value, wherein outputting comprises; outputting a first jumping window including the first data stream interval including the first group of data items and the first aggregated data quality value, and outputting a second jumping window including the second data stream interval including the second group of data items and the second aggregated data quality value. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer program product being tangibly embodied on a computer-readable medium and being configured to cause a data processing apparatus to:
-
obtain a first group of data items and a second group of data items, each data item including one or more data attribute values; determine a first group of data quality items and a second group of data quality items, each data quality item including one or more data quality attribute values associated with one of the data items of the first group of data items or the second group of data items; determine a first aggregated data quality value based on the first group of data quality items and a second aggregated data quality value based on the second group of data quality items; and output a first data stream interval including the first group of data items and the first aggregated data quality value and a second data stream interval including the second group of data items and the second aggregated data quality value, wherein the data processing apparatus is further caused to; output a first jumping window including the first data stream interval including the first group of data items and the first aggregated data quality value, and output a second jumping window including the second data stream interval including the second group of data items and the second aggregated data quality value. - View Dependent Claims (20)
-
Specification