Random sampling from distributed streams
First Claim
1. A method for distributed sampling on a network with a plurality of sites and a coordinator, comprising:
- receiving at the coordinator a data element from a site of the plurality of sites, said data element having a weight randomly associated therewith deemed reportable by comparison at the site to a locally stored global value;
comparing the weight of the data element received with a global value stored at the coordinator; and
performing one of;
updating the global value stored at the coordinator to the weight of the data element received; and
communicating the global value stored at the coordinator back to the site of the plurality of sites.
3 Assignments
0 Petitions
Accused Products
Abstract
Described herein are methods, systems, apparatuses and products for random sampling from distributed streams. An aspect provides a method for distributed sampling on a network with a plurality of sites and a coordinator, including: receiving at the coordinator a data element from a site of the plurality of sites, said data element having a weight randomly associated therewith deemed reportable by comparison at the site to a locally stored global value; comparing the weight of the data element received with a global value stored at the coordinator; and performing one of: updating the global value stored at the coordinator to the weight of the data element received; and communicating the global value stored at the coordinator back to the site of the plurality of sites. Other embodiments are disclosed.
8 Citations
20 Claims
-
1. A method for distributed sampling on a network with a plurality of sites and a coordinator, comprising:
-
receiving at the coordinator a data element from a site of the plurality of sites, said data element having a weight randomly associated therewith deemed reportable by comparison at the site to a locally stored global value; comparing the weight of the data element received with a global value stored at the coordinator; and performing one of; updating the global value stored at the coordinator to the weight of the data element received; and communicating the global value stored at the coordinator back to the site of the plurality of sites. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer program product for distributed sampling on a network with a plurality of sites and a coordinator, comprising:
-
a computer readable storage medium having computer readable program code embodied therewith, the computer readable program code comprising; computer readable program code configured to receive, at the coordinator, a data element from a site of the plurality of sites, said data element having a weight randomly associated therewith deemed reportable by comparison at the site to a locally stored global value; computer readable program code configured to compare the weight of the data element received with a global value stored at the coordinator; and computer readable program code configured to perform one of; updating the global value stored at the coordinator to the weight of the data element received; and communicating the global value stored at the coordinator back to the site of the plurality of sites. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A system comprising:
-
at least one processor; and a memory device operatively connected to the at least one processor; wherein, responsive to execution of program instructions accessible to the at least one processor, the at least one processor is configured to; receive at the coordinator a data element from a site of the plurality of sites, said data element having a weight randomly associated therewith deemed reportable by comparison at the site to a locally stored global value; compare the weight of the data element received with a global value stored at the coordinator; and perform one of; updating the global value stored at the coordinator to the weight of the data element received; and communicating the global value stored at the coordinator back to the site of the plurality of sites.
-
Specification