Monotone probability distributions over the Boolean cube can be learned with sublinear samples

02/09/2020
by   Ronitt Rubinfeld, et al.
0

A probability distribution over the Boolean cube is monotone if flipping the value of a coordinate from zero to one can only increase the probability of an element. Given samples of an unknown monotone distribution over the Boolean cube, we give (to our knowledge) the first algorithm that learns an approximation of the distribution in statistical distance using a number of samples that is sublinear in the domain. To do this, we develop a structural lemma describing monotone probability distributions. The structural lemma has further implications to the sample complexity of basic testing tasks for analyzing monotone probability distributions over the Boolean cube: We use it to give nontrivial upper bounds on the tasks of estimating the distance of a monotone distribution to uniform and of estimating the support size of a monotone distribution. In the setting of monotone probability distributions over the Boolean cube, our algorithms are the first to have sample complexity lower than known lower bounds for the same testing tasks on arbitrary (not necessarily monotone) probability distributions. One further consequence of our learning algorithm is an improved sample complexity for the task of testing whether a distribution on the Boolean cube is monotone.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/06/2019

Towards Testing Monotonicity of Distributions Over General Posets

In this work, we consider the sample complexity required for testing the...
research
11/30/2017

Testing Conditional Independence of Discrete Distributions

We study the problem of testing conditional independence for discrete di...
research
08/27/2023

Testing Junta Truncation

We consider the basic statistical problem of detecting truncation of the...
research
10/31/2018

Testing Halfspaces over Rotation-Invariant Distributions

We present an algorithm for testing halfspaces over arbitrary, unknown r...
research
09/06/2023

Testing properties of distributions in the streaming model

We study distribution testing in the standard access model and the condi...
research
08/15/2017

Generalized Uniformity Testing

In this work, we revisit the problem of uniformity testing of discrete p...
research
01/14/2018

On Identifying a Massive Number of Distributions

Finding the underlying probability distributions of a set of observed se...

Please sign up or login with your details

Forgot password? Click here to reset