Optimal Random Sampling from Distributed Streams Revisited

03/28/2019
by   Srikanta Tirthapura, et al.
0

We give an improved algorithm for drawing a random sample from a large data stream when the input elements are distributed across multiple sites which communicate via a central coordinator. At any point in time the set of elements held by the coordinator represent a uniform random sample from the set of all the elements observed so far. When compared with prior work, our algorithms asymptotically improve the total number of messages sent in the system as well as the computation required of the coordinator. We also present a matching lower bound, showing that our protocol sends the optimal number of messages up to a constant factor with large probability. As a byproduct, we obtain an improved algorithm for finding the heavy hitters across multiple distributed sites.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2019

Weighted Reservoir Sampling from Distributed Streams

We consider message-efficient continuous random sampling from a distribu...
research
08/10/2021

Best lower bound on the probability of a binomial exceeding its expectation

Let X be a random variable distributed according to the binomial distrib...
research
03/06/2023

The Complexity of Geodesic Spanners

A geometric t-spanner for a set S of n point sites is an edge-weighted g...
research
11/15/2018

Secretary Ranking with Minimal Inversions

We study a twist on the classic secretary problem, which we term the sec...
research
05/24/2021

A Simple Proof of a New Set Disjointness with Applications to Data Streams

The multiplayer promise set disjointness is one of the most widely used ...
research
03/28/2022

Efficient Algorithm for Deterministic Search of Hot Elements

When facing a very large stream of data, it is often desirable to extrac...
research
04/09/2014

A Distributed Frank-Wolfe Algorithm for Communication-Efficient Sparse Learning

Learning sparse combinations is a frequent theme in machine learning. In...

Please sign up or login with your details

Forgot password? Click here to reset