Method for generating a distributed stream processing application
First Claim
Patent Images
1. A method for generating a distributed stream processing application, comprising the steps of:
- obtaining a declarative description of one or more data stream processing tasks from a graph of operators, wherein the declarative description expresses at least one stream processing task;
generating one or more execution units from the declarative description of one or more data stream processing tasks, wherein the one or more execution units are deployable across one or more distributed computing nodes, and comprise a distributed data stream processing application binary;
generating one or more coarse granularity containers that encompass one or more fine granularity stream processing operators;
using the one or more coarse granularity containers to generate a distributed stream processing application;
generating one or more containers that encompass a combination of one or more stream processing operators, wherein said generating comprises;
coalescing a combination of one or more operators into one or more containers, wherein said coalescing comprises;
using an optimizer to automatically decide which of the one or more operators are to be coalesced into which of the one or more containers; and
using user input to manually group the one or more operators into the one or more containers; and
fusing an outflow of an operator into an inflow of a downstream operator within a same container.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for generating a distributed stream processing application are provided. The techniques include obtaining a declarative description of one or more data stream processing tasks, wherein the declarative description expresses at least one stream processing task, and generating one or more execution units from the declarative description of one or more data stream processing tasks, wherein the one or more execution units are deployable across one or more distributed computing nodes, and comprise a distributed data stream processing application.
32 Citations
22 Claims
-
1. A method for generating a distributed stream processing application, comprising the steps of:
-
obtaining a declarative description of one or more data stream processing tasks from a graph of operators, wherein the declarative description expresses at least one stream processing task; generating one or more execution units from the declarative description of one or more data stream processing tasks, wherein the one or more execution units are deployable across one or more distributed computing nodes, and comprise a distributed data stream processing application binary; generating one or more coarse granularity containers that encompass one or more fine granularity stream processing operators; using the one or more coarse granularity containers to generate a distributed stream processing application; generating one or more containers that encompass a combination of one or more stream processing operators, wherein said generating comprises; coalescing a combination of one or more operators into one or more containers, wherein said coalescing comprises; using an optimizer to automatically decide which of the one or more operators are to be coalesced into which of the one or more containers; and using user input to manually group the one or more operators into the one or more containers; and fusing an outflow of an operator into an inflow of a downstream operator within a same container. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product comprising a tangible computer readable recordable storage medium having computer readable program code for generating a distributed stream processing application, said computer program product including:
-
computer readable program code for obtaining a declarative description of one or more data stream processing tasks from a graph of operators, wherein the declarative description expresses at least one stream processing task; computer readable program code for generating one or more execution units from the declarative description of one or more data stream processing tasks, wherein the one or more execution units are deployable across one or more distributed computing nodes, and comprise a distributed data stream processing application binary; computer readable program code for generating one or more coarse granularity containers that encompass one or more fine granularity stream processing operators; computer readable program code for using the one or more coarse granularity containers to generate a distributed stream processing application; computer readable program code for generating one or more containers that encompass a combination of one or more stream processing operators, wherein said generating comprises; coalescing a combination of one or more operators into one or more containers, wherein said coalescing comprises; using an optimizer to automatically decide which of the one or more operators are to be coalesced into which of the one or more containers; and using user input to manually group the one or more operators into the one or more containers; and computer readable program code for fusing an outflow of an operator into an inflow of a downstream operator within a same container. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. An apparatus for generating a distributed stream processing application, comprising:
-
a memory; and at least one processor coupled to said memory and operative to; obtain a declarative description of one or more data stream processing tasks from a graph of operators, wherein the declarative description expresses at least one stream processing task; generate one or more execution units from the declarative description of one or more data stream processing tasks, wherein the one or more execution units are deployable across one or more distributed computing nodes, and comprise a distributed data stream processing application binary; generate one or more coarse granularity containers that encompass one or more fine granularity stream processing operators; use the one or more coarse granularity containers to generate a distributed stream processing application; generate one or more containers that encompass a combination of one or more stream processing operators, wherein said generating comprises; coalescing a combination of one or more operators into one or more containers, wherein said coalescing comprises; using an optimizer to automatically decide which of the one or more operators are to be coalesced into which of the one or more containers; and using user input to manually group the one or more operators into the one or more containers; and fuse an outflow of an operator into an inflow of a downstream operator within a same container. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
Specification