Computational and Statistical Boundaries for Submatrix Localization in a Large Noisy Matrix

02/06/2015
by   T. Tony Cai, et al.
0

The interplay between computational efficiency and statistical accuracy in high-dimensional inference has drawn increasing attention in the literature. In this paper, we study computational and statistical boundaries for submatrix localization. Given one observation of (one or multiple non-overlapping) signal submatrix (of magnitude λ and size k_m × k_n) contaminated with a noise matrix (of size m × n), we establish two transition thresholds for the signal to noise λ/σ ratio in terms of m, n, k_m, and k_n. The first threshold, SNR_c, corresponds to the computational boundary. Below this threshold, it is shown that no polynomial time algorithm can succeed in identifying the submatrix, under the hidden clique hypothesis. We introduce adaptive linear time spectral algorithms that identify the submatrix with high probability when the signal strength is above the threshold SNR_c. The second threshold, SNR_s, captures the statistical boundary, below which no method can succeed with probability going to one in the minimax sense. The exhaustive search method successfully finds the submatrix above this threshold. The results show an interesting phenomenon that SNR_c is always significantly larger than SNR_s, which implies an essential gap between statistical optimality and computational efficiency for submatrix localization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/21/2020

Tensor Clustering with Planted Structures: Statistical Optimality and Computational Limits

This paper studies the statistical and computational limits of high-orde...
research
07/14/2019

More Supervision, Less Computation: Statistical-Computational Tradeoffs in Weakly Supervised Learning

We consider the weakly supervised binary classification problem where th...
research
01/22/2022

Optimal Estimation and Computational Limit of Low-rank Gaussian Mixtures

Structural matrix-variate observations routinely arise in diverse fields...
research
05/31/2018

Statistical Problems with Planted Structures: Information-Theoretical and Computational Limits

Over the past few years, insights from computer science, statistical phy...
research
11/21/2020

Phase transition of graph Laplacian of high dimensional noisy random point cloud

We systematically explore the spectral distribution of kernel-based grap...
research
02/21/2017

Phase Transitions of Spectral Initialization for High-Dimensional Nonconvex Estimation

We study a spectral initialization method that serves a key role in rece...
research
06/14/2021

iNNformant: Boundary Samples as Telltale Watermarks

Boundary samples are special inputs to artificial neural networks crafte...

Please sign up or login with your details

Forgot password? Click here to reset