Order-independent stream query processing
First Claim
1. A system for order-independent stream query processing comprising:
- one or more continuous nodes comprising a memory and a processor, wherein each of the one or more continuous nodes are configured to;
partition received data into one or more partitions;
decompose a received query into one or more sub-queries that can operate on the one or more partitions to produce partial results for each partition;
receive an out-of-order datum, the out-of-order datum being too late to process for an original partition;
select a partition whose latest data is closest to the out-of-order datum;
process the out-of-order datum in the selected partition;
consolidate the partial results for each partition on a continuous basis to produce final results.
0 Assignments
0 Petitions
Accused Products
Abstract
In a system and method for order-independent stream query processing, one or more input streams of data are received, and the one or more input streams are analyzed to determine data which is older than an already emitted progress indicator. The data which is older than the already emitted progress indicator is partitioned into one or more partitions, and each of the one or more partitions are independently processed using out-of-order processing techniques. A query is received, rewritten and decomposed into one or more sub-queries that produce partial results for each of the one or more partitions, where each of the one or more sub-queries correspond to a partition. A view is also produced that consolidates the partial results for each partition. The partial results are consolidated at a consolidation time specified by the query to produce final results, and the final results are provided.
-
Citations
13 Claims
-
1. A system for order-independent stream query processing comprising:
-
one or more continuous nodes comprising a memory and a processor, wherein each of the one or more continuous nodes are configured to; partition received data into one or more partitions; decompose a received query into one or more sub-queries that can operate on the one or more partitions to produce partial results for each partition; receive an out-of-order datum, the out-of-order datum being too late to process for an original partition; select a partition whose latest data is closest to the out-of-order datum;
process the out-of-order datum in the selected partition;
consolidate the partial results for each partition on a continuous basis to produce final results. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
Specification