Curse of Heterogeneity: Computational Barriers in Sparse Mixture Models and Phase Retrieval

08/21/2018
by   Jianqing Fan, et al.
0

We study the fundamental tradeoffs between statistical accuracy and computational tractability in the analysis of high dimensional heterogeneous data. As examples, we study sparse Gaussian mixture model, mixture of sparse linear regressions, and sparse phase retrieval model. For these models, we exploit an oracle-based computational model to establish conjecture-free computationally feasible minimax lower bounds, which quantify the minimum signal strength required for the existence of any algorithm that is both computationally tractable and statistically accurate. Our analysis shows that there exist significant gaps between computationally feasible minimax risks and classical ones. These gaps quantify the statistical price we must pay to achieve computational tractability in the presence of data heterogeneity. Our results cover the problems of detection, estimation, support recovery, and clustering, and moreover, resolve several conjectures of Azizyan et al. (2013, 2015); Verzelen and Arias-Castro (2017); Cai et al. (2016). Interestingly, our results reveal a new but counter-intuitive phenomenon in heterogeneous data analysis that more data might lead to less computation complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/30/2015

Sharp Computational-Statistical Phase Transitions via Oracle Computational Model

We study the fundamental tradeoffs between computational tractability an...
research
08/08/2019

Average-Case Lower Bounds for Learning Sparse Mixtures, Robust Estimation and Semirandom Adversaries

This paper develops several average-case reduction techniques to show ne...
research
07/21/2022

Bayesian Sparse Gaussian Mixture Model in High Dimensions

We establish the minimax risk for parameter estimation in sparse high-di...
research
04/03/2013

Computational Lower Bounds for Sparse PCA

In the context of sparse principal component detection, we bring evidenc...
research
01/22/2022

Optimal Estimation and Computational Limit of Low-rank Gaussian Mixtures

Structural matrix-variate observations routinely arise in diverse fields...
research
11/01/2014

Learning Mixed Multinomial Logit Model from Ordinal Data

Motivated by generating personalized recommendations using ordinal (or p...
research
06/29/2020

Recovery of Sparse Signals from a Mixture of Linear Samples

Mixture of linear regressions is a popular learning theoretic model that...

Please sign up or login with your details

Forgot password? Click here to reset