Minimax Theory for High-dimensional Gaussian Mixtures with Sparse Mean Separation

06/09/2013
by   Martin Azizyan, et al.
0

While several papers have investigated computationally and statistically efficient methods for learning Gaussian mixtures, precise minimax bounds for their statistical performance as well as fundamental limits in high-dimensional settings are not well-understood. In this paper, we provide precise information theoretic bounds on the clustering accuracy and sample complexity of learning a mixture of two isotropic Gaussians in high dimensions under small mean separation. If there is a sparse subset of relevant dimensions that determine the mean separation, then the sample complexity only depends on the number of relevant dimensions and mean separation, and can be achieved by a simple computationally efficient procedure. Our results provide the first step of a theoretical basis for recent methods that combine feature selection and clustering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2020

Optimal estimation of high-dimensional Gaussian mixtures

This paper studies the optimal rate of estimation in a finite Gaussian l...
research
02/19/2014

Near-optimal-sample estimators for spherical Gaussian mixtures

Statistical and machine-learning algorithms are frequently applied to hi...
research
07/26/2022

Efficient Algorithms for Sparse Moment Problems without Separation

We consider the sparse moment problem of learning a k-spike mixture in h...
research
07/21/2022

Bayesian Sparse Gaussian Mixture Model in High Dimensions

We establish the minimax risk for parameter estimation in sparse high-di...
research
06/25/2019

Spectral Properties of Radial Kernels and Clustering in High Dimensions

In this paper, we study the spectrum and the eigenvectors of radial kern...
research
11/07/2020

Limits on Testing Structural Changes in Ising Models

We present novel information-theoretic limits on detecting sparse change...
research
03/02/2017

Being Robust (in High Dimensions) Can Be Practical

Robust estimation is much more challenging in high dimensions than it is...

Please sign up or login with your details

Forgot password? Click here to reset