Method and Apparatus for Maintaining and Processing Provenance Data in Data Stream Processing System
First Claim
1. A method for processing data associated with a data stream received by a data stream processing system, wherein the system comprises a plurality of processing elements, the method comprising the steps of:
- maintaining a portion of data associated with the data stream, wherein the maintained data comprises inputs to each processing element that contributed to an output of each processing element;
in response to an alert generated by one of the processing elements, triggering a scheduler to determine when a pre-calculation of a prospective query related to the alert should be executed; and
in response to the scheduler, using at least a portion of the maintained data to determine a set of data that contributed to the alert such that the alert-contributing set of data can be used to respond to the prospective query upon arrival thereof.
5 Assignments
0 Petitions
Accused Products
Abstract
Techniques are disclosed for maintaining and processing provenance data in such data stream processing systems. For example, a method for processing data associated with a data stream received by a data stream processing system, wherein the system comprises a plurality of processing elements, comprising the following steps. A portion of data associated with the data stream is maintained. The maintained data comprises inputs to each processing element that contributed to an output of each processing element. In response to an alert generated by one of the processing elements, a scheduler is triggered to determine when a pre-calculation of a prospective query related to the alert should be executed. In response to the scheduler, at least a portion of the maintained data is used to determine a set of data that contributed to the alert such that the alert-contributing set of data can be used to respond to the prospective query upon arrival thereof.
15 Citations
20 Claims
-
1. A method for processing data associated with a data stream received by a data stream processing system, wherein the system comprises a plurality of processing elements, the method comprising the steps of:
-
maintaining a portion of data associated with the data stream, wherein the maintained data comprises inputs to each processing element that contributed to an output of each processing element; in response to an alert generated by one of the processing elements, triggering a scheduler to determine when a pre-calculation of a prospective query related to the alert should be executed; and in response to the scheduler, using at least a portion of the maintained data to determine a set of data that contributed to the alert such that the alert-contributing set of data can be used to respond to the prospective query upon arrival thereof. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. Apparatus for processing data associated with a data stream received by a data stream processing system, wherein the system comprises a plurality of processing elements, the apparatus comprising:
-
a memory; and a processor coupled to the memory and configured to;
maintain a portion of data associated with the data stream, wherein the maintained data comprises inputs to each processing element that contributed to an output of each processing element;
in response to an alert generated by one of the processing elements, trigger a scheduler to determine when a pre-calculation of a prospective query related to the alert should be executed; and
in response to the scheduler, use at least a portion of the maintained data to determine a set of data that contributed to the alert such that the alert-contributing set of data can be used to respond to the prospective query upon arrival thereof. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. An article of manufacture for processing data associated with a data stream received by a data stream processing system, wherein the system comprises a plurality of processing elements, the article comprising a computer readable storage medium having one or more programs embodied therewith wherein the one or more programs, when executed by a computer, perform steps of:
-
maintaining a portion of data associated with the data stream, wherein the maintained data comprises inputs to each processing element that contributed to an output of each processing element; in response to an alert generated by one of the processing elements, triggering a scheduler to determine when a pre-calculation of a prospective query related to the alert should be executed; and in response to the scheduler, using at least a portion of the maintained data to determine a set of data that contributed to the alert such that the alert-contributing set of data can be used to respond to the prospective query upon arrival thereof.
-
Specification