Testing properties of distributions in the streaming model

09/06/2023
by   Sampriti Roy, et al.
0

We study distribution testing in the standard access model and the conditional access model when the memory available to the testing algorithm is bounded. In both scenarios, the samples appear in an online fashion and the goal is to test the properties of distribution using an optimal number of samples subject to a memory constraint on how many samples can be stored at a given time. First, we provide a trade-off between the sample complexity and the space complexity for testing identity when the samples are drawn according to the conditional access oracle. We then show that we can learn a succinct representation of a monotone distribution efficiently with a memory constraint on the number of samples that are stored that is almost optimal. We also show that the algorithm for monotone distributions can be extended to a larger class of decomposable distributions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2019

Communication and Memory Efficient Testing of Discrete Distributions

We study distribution testing with communication and memory constraints ...
research
11/17/2019

Testing Properties of Multiple Distributions with Few Samples

We propose a new setting for testing properties of distributions while r...
research
02/09/2020

Monotone probability distributions over the Boolean cube can be learned with sublinear samples

A probability distribution over the Boolean cube is monotone if flipping...
research
12/07/2020

VC Dimension and Distribution-Free Sample-Based Testing

We consider the problem of determining which classes of functions can be...
research
11/17/2019

Random Restrictions of High-Dimensional Distributions and Uniformity Testing with Subcube Conditioning

We give a nearly-optimal algorithm for testing uniformity of distributio...
research
07/26/2022

The Sample Complexity of Forecast Aggregation

We consider a Bayesian forecast aggregation model where n experts, after...
research
05/12/2022

Sequential algorithms for testing identity and closeness of distributions

What advantage do sequential procedures provide over batch algorithms fo...

Please sign up or login with your details

Forgot password? Click here to reset