Comparison Graphs: a Unified Method for Uniformity Testing

12/03/2020
by   Uri Meir, et al.
0

Distribution testing can be described as follows: q samples are being drawn from some unknown distribution P over a known domain [n]. After the sampling process, a decision must be made about whether P holds some property, or is far from it. The most studied problem in the field is arguably uniformity testing, where one needs to distinguish the case that P is uniform over [n] from the case that P is ϵ-far from being uniform (in ℓ_1). In the classic model, it is known that Θ(√(n)/ϵ^2) samples are necessary and sufficient for this task. This problem was recently considered in various restricted models that pose, for example, communication or memory constraints. In more than one occasion, the known optimal solution boils down to counting collisions among the drawn samples (each two samples that have the same value add one to the count), an idea that dates back to the first uniformity tester, and was coined the name "collision-based tester". In this paper, we introduce the notion of comparison graphs and use it to formally define a generalized collision-based tester. Roughly speaking, the edges of the graph indicate the tester which pairs of samples should be compared (that is, the original tester is induced by a clique, where all pairs are being compared). We prove a structural theorem that gives a sufficient condition for a comparison graph to induce a good uniformity tester. As an application, we develop a generic method to test uniformity, and devise nearly-optimal uniformity testers under various computational constraints. We improve and simplify a few known results, and introduce a new constrained model in which the method also produces an efficient tester. The idea behind our method is to translate computational constraints of a certain model to ones on the comparison graph, which paves the way to finding a good graph.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/07/2023

Testing Graph Properties with the Container Method

We establish nearly optimal sample complexity bounds for testing the ρ-c...
research
07/06/2019

Testing Mixtures of Discrete Distributions

There has been significant study on the sample complexity of testing pro...
research
08/15/2017

Generalized Uniformity Testing

In this work, we revisit the problem of uniformity testing of discrete p...
research
05/23/2019

Testing Graphs against an Unknown Distribution

The area of graph property testing seeks to understand the relation betw...
research
01/18/2019

Towards a General Direct Product Testing Theorem

The Direct Product encoding of a string a∈{0,1}^n on an underlying domai...
research
06/19/2022

On The Memory Complexity of Uniformity Testing

In this paper we consider the problem of uniformity testing with limited...
research
03/16/2020

Stochastic Frontier Analysis with Generalized Errors: inference, model comparison and averaging

Our main contribution lies in formulation of a generalized, parametric m...

Please sign up or login with your details

Forgot password? Click here to reset