Streaming algorithms for the missing item finding problem

11/09/2022
by   Manuel Stoeckl, et al.
0

Many problems on data streams have been studied at two extremes of difficulty: either allowing randomized algorithms, in the static setting (where they should err with bounded probability on the worst case stream); or when only deterministic and infallible algorithms are required. Some recent works have considered the adversarial setting, in which a randomized streaming algorithm must succeed even on data streams provided by an adaptive adversary that can see the intermediate outputs of the algorithm. In order to better understand the differences between these models, we study a streaming task called "Missing Item Finding". In this problem, for r < n, one is given a data stream a_1,…,a_r of elements in [n], (possibly with repetitions), and must output some x ∈ [n] which does not equal any of the a_i. We prove that, for r = n^Θ(1) and δ = 1/poly(n), the space required for randomized algorithms that solve this problem in the static setting with error δ is Θ(polylog(n)); for algorithms in the adversarial setting with error δ, Θ((1 + r^2 / n) polylog(n)); and for deterministic algorithms, Θ(r / polylog(n)). Because our adversarially robust algorithm relies on free access to a string of O(r log n) random bits, we investigate a "random start" model of streaming algorithms where all random bits used are included in the space cost. Here we find a conditional lower bound on the space usage, which depends on the space that would be needed for a pseudo-deterministic algorithm to solve the problem. We also prove an Ω(r / polylog(n)) lower bound for the space needed by a streaming algorithm with < 1/2^polylog(n) error against "white-box" adversaries that can see the internal state of the algorithm, but not predict its future random decisions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2023

Tight Space Lower Bound for Pseudo-Deterministic Approximate Counting

We investigate one of the most basic problems in streaming algorithms: a...
research
04/19/2022

The White-Box Adversarial Data Stream Model

We study streaming algorithms in the white-box adversarial model, where ...
research
11/26/2017

Online Facility Location on Semi-Random Streams

In the streaming model, the order of the stream can significantly affect...
research
03/28/2022

Efficient Algorithm for Deterministic Search of Hot Elements

When facing a very large stream of data, it is often desirable to extrac...
research
03/28/2023

Lower Bounds for Pseudo-Deterministic Counting in a Stream

Many streaming algorithms provide only a high-probability relative appro...
research
07/07/2023

Improved Algorithms for White-Box Adversarial Streams

We study streaming algorithms in the white-box adversarial stream model,...
research
11/26/2019

Pseudo-deterministic Streaming

A pseudo-deterministic algorithm is a (randomized) algorithm which, when...

Please sign up or login with your details

Forgot password? Click here to reset