Exploring the Gap between Tolerant and Non-tolerant Distribution Testing

10/19/2021
by   Sourav Chakraborty, et al.
0

The framework of distribution testing is currently ubiquitous in the field of property testing. In this model, the input is a probability distribution accessible via independently drawn samples from an oracle. The testing task is to distinguish a distribution that satisfies some property from a distribution that is far from satisfying it in the ℓ_1 distance. The task of tolerant testing imposes a further restriction, that distributions close to satisfying the property are also accepted. This work focuses on the connection of the sample complexities of non-tolerant ("traditional") testing of distributions and tolerant testing thereof. When limiting our scope to label-invariant (symmetric) properties of distribution, we prove that the gap is at most quadratic. Conversely, the property of being the uniform distribution is indeed known to have an almost-quadratic gap. When moving to general, not necessarily label-invariant properties, the situation is more complicated, and we show some partial results. We show that if a property requires the distributions to be non-concentrated, then it cannot be non-tolerantly tested with o(√(n)) many samples, where n denotes the universe size. Clearly, this implies at most a quadratic gap, because a distribution can be learned (and hence tolerantly tested against any property) using 𝒪(n) many samples. Being non-concentrated is a strong requirement on the property, as we also prove a close to linear lower bound against their tolerant tests. To provide evidence for other general cases (where the properties are not necessarily label-invariant), we show that if an input distribution is very concentrated, in the sense that it is mostly supported on a subset of size s of the universe, then it can be learned using only 𝒪(s) many samples. The learning procedure adapts to the input, and works without knowing s in advance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/25/2022

Testing of Index-Invariant Properties in the Huge Object Model

The study of distribution testing has become ubiquitous in the area of p...
research
04/20/2023

New Lower Bounds for Adaptive Tolerant Junta Testing

We prove a k^-Ω(log(ε_2 - ε_1)) lower bound for adaptively testing wheth...
research
09/07/2019

Hard properties with (very) short PCPPs and their applications

We show that there exist properties that are maximally hard for testing,...
research
07/20/2020

Tolerant Distribution Testing in the Conditional Sampling Model

Recently, there has been significant work studying distribution testing ...
research
10/31/2018

Testing Halfspaces over Rotation-Invariant Distributions

We present an algorithm for testing halfspaces over arbitrary, unknown r...
research
04/26/2022

Tolerant Bipartiteness Testing in Dense Graphs

Bipartite testing has been a central problem in the area of property tes...
research
04/03/2023

Distribution Testing Under the Parity Trace

Distribution testing is a fundamental statistical task with many applica...

Please sign up or login with your details

Forgot password? Click here to reset