SYSTEM AND METHOD FOR ANALYZING STREAMS AND COUNTING STREAM ITEMS ON MULTI-CORE PROCESSORS
First Claim
1. A method for parallel stream item counting, comprising:
- partitioning a data stream into portions;
assigning the portions to a plurality of processing cores; and
executing a sequential kernel on a computer useable medium including a computer readable program, wherein the computer readable program when executed on the processing core causes the processing core to compute a local count for items in an assigned portion of the data stream for that processing core.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for parallel stream item counting are disclosed. A data stream is partitioned into portions and the portions are assigned to a plurality of processing cores. A sequential kernel is executed at each processing core to compute a local count for items in an assigned portion of the data stream for that processing core. The counts are aggregated for all the processing cores to determine a final count for the items in the data stream. A frequency-aware counting method (FCM) for data streams includes dynamically capturing relative frequency phases of items from a data stream and placing the items in a sketch structure using a plurality of hash functions where a number of hash functions is based on the frequency phase of the item. A zero-frequency table is provided to reduce errors due to absent items.
85 Citations
20 Claims
-
1. A method for parallel stream item counting, comprising:
-
partitioning a data stream into portions; assigning the portions to a plurality of processing cores; and executing a sequential kernel on a computer useable medium including a computer readable program, wherein the computer readable program when executed on the processing core causes the processing core to compute a local count for items in an assigned portion of the data stream for that processing core. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A frequency-aware counting method (FCM) for data streams, comprising:
-
dynamically capturing relative frequency phases of items from a data stream; placing the items in a sketch structure using a plurality of hash functions where a number of hash functions is based on the frequency phase of the item; and providing a zero-frequency table to reduce errors due to absent items. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A multiprocessor, comprising:
-
a plurality of processing cores, which include a coordinating processor core, and at least one other processing core managed by the coordinating processing core; the plurality of processing cores being configured to receive and partition a data stream into portions to permit parallel counting for items in the data stream; a sequencing kernel executed by each of the core processors to implement a frequency-aware counting method, the frequency aware counting method including; a sketch structure configured to store item counts using a plurality of hash functions where a number of hash functions is based on the frequency phase of the item; and a zero-frequency table configured to reduce errors due to absent items using all of the hash functions. - View Dependent Claims (19, 20)
-
Specification