Support Size Estimation: The Power of Conditioning

11/22/2022
by   Diptarka Chakraborty, et al.
0

We consider the problem of estimating the support size of a distribution D. Our investigations are pursued through the lens of distribution testing and seek to understand the power of conditional sampling (denoted as COND), wherein one is allowed to query the given distribution conditioned on an arbitrary subset S. The primary contribution of this work is to introduce a new approach to lower bounds for the COND model that relies on using powerful tools from information theory and communication complexity. Our approach allows us to obtain surprisingly strong lower bounds for the COND model and its extensions. 1) We bridge the longstanding gap between the upper (O(loglog n + 1/ϵ^2)) and the lower bound Ω(√(loglog n)) for COND model by providing a nearly matching lower bound. Surprisingly, we show that even if we get to know the actual probabilities along with COND samples, still Ω(loglog n + 1/ϵ^2 log (1/ϵ)) queries are necessary. 2) We obtain the first non-trivial lower bound for COND equipped with an additional oracle that reveals the conditional probabilities of the samples (to the best of our knowledge, this subsumes all of the models previously studied): in particular, we demonstrate that Ω(logloglog n + 1/ϵ^2 log (1/ϵ)) queries are necessary.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2020

Tolerant Distribution Testing in the Conditional Sampling Model

Recently, there has been significant work studying distribution testing ...
research
11/15/2019

New Query Lower Bounds for Submodular Function MInimization

We consider submodular function minimization in the oracle model: given ...
research
07/12/2022

Optimal Clustering with Noisy Queries via Multi-Armed Bandit

Motivated by many applications, we study clustering with a faulty oracle...
research
01/04/2020

Discovering and Certifying Lower Bounds for the Online Bin Stretching Problem

There are several problems in the theory of online computation where tig...
research
05/03/2021

Lower Bounds on the Time/Memory Tradeoff of Function Inversion

We study time/memory tradeoffs of function inversion: an algorithm, i.e....
research
02/01/2020

Oracle lower bounds for stochastic gradient sampling algorithms

We consider the problem of sampling from a strongly log-concave density ...
research
04/19/2019

Uncertainty about Uncertainty: Near-Optimal Adaptive Algorithms for Estimating Binary Mixtures of Unknown Coins

Given a mixture between two populations of coins, "positive" coins that ...

Please sign up or login with your details

Forgot password? Click here to reset