Statistical Problems with Planted Structures: Information-Theoretical and Computational Limits

05/31/2018
by   Yihong Wu, et al.
0

Over the past few years, insights from computer science, statistical physics, and information theory have revealed phase transitions in a wide array of high-dimensional statistical problems at two distinct thresholds: One is the information-theoretical (IT) threshold below which the observation is too noisy so that inference of the ground truth structure is impossible regardless of the computational cost; the other is the computational threshold above which inference can be performed efficiently, i.e., in time that is polynomial in the input size. In the intermediate regime, inference is information-theoretically possible, but conjectured to be computationally hard. This article provides a survey of the common techniques for determining the sharp IT and computational limits, using community detection and submatrix detection as illustrating examples. For IT limits, we discuss tools including the first and second moment method for analyzing the maximal likelihood estimator, information-theoretic methods for proving impossibility results using rate-distortion theory, and methods originated from statistical physics such as interpolation method. To investigate computational limits, we describe a common recipe to construct a randomized polynomial-time reduction scheme that approximately maps instances of the planted clique problem to the problem of interest in total variation distance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2017

Community detection and stochastic block models: recent developments

The stochastic block model (SBM) is a random graph model with planted cl...
research
11/15/2020

Contextual Stochastic Block Model: Sharp Thresholds and Contiguity

We study community detection in the contextual stochastic block model ar...
research
01/18/2016

Statistical Mechanics of High-Dimensional Inference

To model modern large-scale datasets, we need efficient algorithms to in...
research
10/15/2022

Disordered Systems Insights on Computational Hardness

In this review article, we discuss connections between the physics of di...
research
02/06/2015

Computational and Statistical Boundaries for Submatrix Localization in a Large Noisy Matrix

The interplay between computational efficiency and statistical accuracy ...
research
11/08/2015

Statistical physics of inference: Thresholds and algorithms

Many questions of fundamental interest in todays science can be formulat...
research
01/01/2018

Statistical and Computational Limits for Sparse Matrix Detection

This paper investigates the fundamental limits for detecting a high-dime...

Please sign up or login with your details

Forgot password? Click here to reset