Learning Entangled Single-Sample Gaussians in the Subset-of-Signals Model

07/10/2020
by   Yingyu Liang, et al.
0

In the setting of entangled single-sample distributions, the goal is to estimate some common parameter shared by a family of n distributions, given one single sample from each distribution. This paper studies mean estimation for entangled single-sample Gaussians that have a common mean but different unknown variances. We propose the subset-of-signals model where an unknown subset of m variances are bounded by 1 while there are no assumptions on the other variances. In this model, we analyze a simple and natural method based on iteratively averaging the truncated samples, and show that the method achieves error O (√(nln n)/m) with high probability when m=Ω(√(nln n)), matching existing bounds for this range of m. We further prove lower bounds, showing that the error is Ω((n/m^4)^1/2) when m is between Ω(ln n) and O(n^1/4), and the error is Ω((n/m^4)^1/6) when m is between Ω(n^1/4) and O(n^1 - ϵ) for an arbitrarily small ϵ>0, improving existing lower bounds and extending to a wider range of m.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2020

Learning Entangled Single-Sample Distributions via Iterative Trimming

In the setting of entangled single-sample distributions, the goal is to ...
research
06/10/2022

List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering

We study the problem of list-decodable sparse mean estimation. Specifica...
research
06/06/2021

Towards Practical Mean Bounds for Small Samples

Historically, to bound the mean for small sample sizes, practitioners ha...
research
08/07/2023

Partial identification of kernel based two sample tests with mismeasured data

Nonparametric two-sample tests such as the Maximum Mean Discrepancy (MMD...
research
02/21/2020

Private Mean Estimation of Heavy-Tailed Distributions

We give new upper and lower bounds on the minimax sample complexity of d...
research
10/28/2019

Testing Equivalence of Clustering

In this paper, we test whether two datasets share a common clustering st...
research
08/16/2018

Active Distribution Learning from Indirect Samples

This paper studies the problem of learning the probability distribution...

Please sign up or login with your details

Forgot password? Click here to reset