SYSTEM AND METHOD FOR MULTIPLE DISTINCT AGGREGATE QUERIES
First Claim
1. A data processor implemented method of executing multiple distinct aggregate type queries, comprising:
- providing at least one Counting Bloom Filter for each distinct column of an input data stream;
reviewing count values in the at least one Counting Bloom Filter for the existence of duplicates in each distinct column; and
if necessary, using a distinct hash operator to remove duplicates from each distinct column of the input data stream, thereby removing the need for replicating the input data stream and minimizing distinct hash operator processing.
4 Assignments
0 Petitions
Accused Products
Abstract
There is disclosed a system and method for executing multiple distinct aggregate queries. In an embodiment, the method comprises: providing at least one Counting Bloom Filter for each distinct column of an input data stream; reviewing count values in the at least one Counting Bloom Filter for the existence of duplicates in each distinct column; and if necessary, using a distinct hash operator to remove duplicates from each distinct column of the input data stream, thereby removing the need for replicating the input data stream and minimizing distinct hash operator processing. Also, the use of Counting Bloom Filters for monitoring data streams allow an early duplicate removal of the input stream of data, resulting in savings in computation time and memory resources.
42 Citations
24 Claims
-
1. A data processor implemented method of executing multiple distinct aggregate type queries, comprising:
-
providing at least one Counting Bloom Filter for each distinct column of an input data stream; reviewing count values in the at least one Counting Bloom Filter for the existence of duplicates in each distinct column; and if necessary, using a distinct hash operator to remove duplicates from each distinct column of the input data stream, thereby removing the need for replicating the input data stream and minimizing distinct hash operator processing. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for executing multiple distinct aggregate type queries, comprising:
-
at least one Counting Bloom Filter for each distinct column of an input data stream; means for reviewing count values in the at least one Counting Bloom Filter for the existence of duplicates in each distinct column; and a distinct hash operator for removing duplicates, if necessary, from each distinct column of the input data stream, thereby removing the need for replicating the input data stream and minimizing distinct hash operator processing. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A data processor readable medium storing data processor code that when loaded into a data processing device adapts the device to perform a method of executing multiple distinct aggregate type queries, the data processor readable medium comprising:
-
code for providing at least one Counting Bloom Filter for each distinct column of an input data stream; code for reviewing count values in the at least one Counting Bloom Filter for the existence of duplicates in each distinct column; and code for using a distinct hash operator, if necessary, to remove duplicates from each distinct column of the input data stream, thereby removing the need for replicating the input data stream and minimizing distinct hash operator processing. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification