System and method for benchmarking correlated stream processing systems
2 Assignments
0 Petitions
Accused Products
Abstract
A system, method, and computer program product for benchmarking a stream processing system are disclosed. The method comprises generating a plurality of correlated test streams. A semantically related data set is embedded within each of the test streams in the plurality of correlated test streams. The plurality of correlated test streams is provided to at least one stream processing system. A summary is generated for each of the semantically related embedded data sets. A common identifier, which is transparent to the system being tested, is embedded within each stream in the plurality of correlated test streams. The common identifier is extracted from the output data set generated by the stream processing system. At least one of the stored copies of the summaries and the common identifier are compared to an output data set including a set of zero or more correlation results generated by the stream processing system.
-
Citations
20 Claims
-
1. (canceled)
-
2. A method on an information processing system for benchmarking a stream processing system, the method comprising:
-
generating a plurality of correlated test streams;
embedding a semantically related data set within each of the test streams in the plurality of correlated test streams;
providing the plurality of correlated test streams to at least one stream processing system, whereby the stream processing system produces an output data set including a set of zero or more correlation results;
generating a summary for each of the semantically related embedded data sets;
storing a copy of each summary in memory;
embedding a common identifier within each stream in the plurality of correlated test streams, wherein the common identifier is transparent to the at least one stream processing system so as not to affect the set of the correlation results, and wherein the common identifier uniquely identifies the plurality of correlated test streams;
extracting the common identifier from the output data set generated by the stream processing system; and
comparing at least one of the common identifier and the stored copies of the summaries to the output data set generated by the stream processing system. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. (canceled)
-
14. An information processing system for benchmarking a stream processing system, the information processing system comprising:
-
a test stream generator for generating a plurality of correlated test streams wherein each test stream in the plurality of correlated test streams includes a semantically related embedded data set;
a transmitter for providing the plurality of correlated test streams to at least one stream processing system;
a summary generator for generating a summary of each semantically related embedded data set;
a memory for storing a copy of each summary;
a common identifier generator for generating a common identifier associated with each test stream in the plurality of correlated test streams and for embedding the common identifier within each stream in the plurality of correlated test streams, wherein the common identifier is transparent to the at least one stream processing system so as not to affect the set of the correlation results, and wherein the common identifier uniquely identifies the plurality of correlated test streams;
an extractor for extracting the common identifier from the output data set generated by the stream processing system; and
a comparator for comparing at least one of the copies of the summaries of the semantically related embedded data and the common identifier to an output data set including a set of zero or more correlation results generated by the stream processing system. wherein the comparator compares the stored copy of each summary to the common identifiers extracted from an output data set generated by the stream processing system. - View Dependent Claims (15)
-
-
16. (canceled)
-
17. A computer readable storage medium for benchmarking a stream processing system, the computer readable storage medium comprising instructions for:
-
generating a plurality of correlated test streams;
embedding a semantically related data set within each of the test streams in the plurality of correlated test streams;
providing the plurality of correlated test streams to at least one stream processing system, whereby the stream processing system produces an output data set including a set of zero or more correlation results;
generating a summary for each of the semantically related embedded data sets;
storing a copy of each summary in memory;
embedding a common identifier within each stream in the plurality of correlated test streams, wherein the common identifier is transparent to the at least one stream processing system so as not to affect the set of the correlation results, and wherein the common identifier uniquely identifies the plurality of correlated test streams;
extracting the common identifier from the output data set generated by the stream processing system; and
comparing at least one of the common identifier and the stored copies of the summaries to the output data set generated by the stream processing system. - View Dependent Claims (18, 19, 20)
-
Specification