QUERY-AWARE SAMPLING OF DATA STREAMS
First Claim
1. A method of assigning sampling methods to each input stream for arbitrary query sets in a data stream management system, the method comprising:
- splitting all query nodes in a query directed acyclic graph (DAG) having multiple parent nodes into sets of independent nodes having a single parent;
computing a grouping set for every node in each set of independent nodes;
reconciling each parent node with each child node in each set of independent nodes;
reconciling between multiple child nodes that share a parent node; and
generating a final grouping set for at least one node describing how to sample an input stream for that node.
1 Assignment
0 Petitions
Accused Products
Abstract
A system, method and computer-readable medium provide for assigning sampling methods to each input stream for arbitrary query sets in a data stream management system. The method embodiment comprises splitting all query nodes in a query directed acyclic graph (DAG) having multiple parent nodes into sets of independent nodes having a single parent, computing a grouping set for every node in each set of independent nodes, reconciling each parent node with each child node in each set of independent node, reconciling between multiple child nodes that share a parent node and generating a final grouping set for at least one node describing how to sample an input stream for that node.
111 Citations
18 Claims
-
1. A method of assigning sampling methods to each input stream for arbitrary query sets in a data stream management system, the method comprising:
-
splitting all query nodes in a query directed acyclic graph (DAG) having multiple parent nodes into sets of independent nodes having a single parent;
computing a grouping set for every node in each set of independent nodes;
reconciling each parent node with each child node in each set of independent nodes;
reconciling between multiple child nodes that share a parent node; and
generating a final grouping set for at least one node describing how to sample an input stream for that node. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for assigning sampling methods to each input stream for arbitrary query sets in a data stream management system, the system comprising:
-
a module configured to split all query nodes in a query directed acyclic graph (DAG) having multiple parent nodes into sets of independent nodes having a single parent;
a module configured to compute a grouping set for every node in each set of independent nodes;
a module configured to reconcile each parent node with each child node in each set of independent nodes;
a module configured to reconcile between multiple child nodes that share a parent node; and
a module configured to generate a final grouping set for at least one node describing how to sample an input stream for that node. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer readable medium storing instructions for controlling a computing device to assigning sampling methods to each input stream for arbitrary query sets in a data stream management system, the instructions comprising:
-
splitting all query nodes in a query directed acyclic graph (DAG) having multiple parent nodes into sets of independent nodes having a single parent;
computing a grouping set for every node in each set of independent nodes;
reconciling each parent node with each child node in each set of independent nodes;
reconciling between multiple child nodes that share a parent node; and
generating a final grouping set for at least one node describing how to sample an input stream for that node. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification