On The Memory Complexity of Uniformity Testing

06/19/2022
by   Tomer Berg, et al.
0

In this paper we consider the problem of uniformity testing with limited memory. We observe a sequence of independent identically distributed random variables drawn from a distribution p over [n], which is either uniform or is ε-far from uniform under the total variation distance, and our goal is to determine the correct hypothesis. At each time point we are allowed to update the state of a finite-memory machine with S states, where each state of the machine is assigned one of the hypotheses, and we are interested in obtaining an asymptotic probability of error at most 0<δ<1/2 uniformly under both hypotheses. The main contribution of this paper is deriving upper and lower bounds on the number of states S needed in order to achieve a constant error probability δ, as a function of n and ε, where our upper bound is O(nlog n/ε) and our lower bound is Ω (n+1/ε). Prior works in the field have almost exclusively used collision counting for upper bounds, and the Paninski mixture for lower bounds. Somewhat surprisingly, in the limited memory with unlimited samples setup, the optimal solution does not involve counting collisions, and the Paninski prior is not hard. Thus, different proof techniques are needed in order to attain our bounds.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2018

On closeness to k-wise uniformity

A probability distribution over -1, 1^n is (eps, k)-wise uniform if, rou...
research
05/15/2020

Binary Hypothesis Testing with Deterministic Finite-Memory Decision Rules

In this paper we consider the problem of binary hypothesis testing with ...
research
06/19/2022

Deterministic Finite-Memory Bias Estimation

In this paper we consider the problem of estimating a Bernoulli paramete...
research
07/10/2020

An Upper Bound on the Error Induced by Saddlepoint Approximations – Applications to Information Theory

This paper introduces an upper bound on the absolute difference between:...
research
01/17/2020

Lower bounds for the maximum number of runners that cause loneliness, and its application to Isolation

We consider (n+1) runners with given constant unique integer speeds runn...
research
05/28/2018

High Probability Frequency Moment Sketches

We consider the problem of sketching the p-th frequency moment of a vect...
research
12/03/2020

Comparison Graphs: a Unified Method for Uniformity Testing

Distribution testing can be described as follows: q samples are being dr...

Please sign up or login with your details

Forgot password? Click here to reset