Detection and Recovery of Hidden Submatrices

06/11/2023
by   Marom Dadon, et al.
0

In this paper, we study the problems of detection and recovery of hidden submatrices with elevated means inside a large Gaussian random matrix. We consider two different structures for the planted submatrices. In the first model, the planted matrices are disjoint, and their row and column indices can be arbitrary. Inspired by scientific applications, the second model restricts the row and column indices to be consecutive. In the detection problem, under the null hypothesis, the observed matrix is a realization of independent and identically distributed standard normal entries. Under the alternative, there exists a set of hidden submatrices with elevated means inside the same standard normal matrix. Recovery refers to the task of locating the hidden submatrices. For both problems, and for both models, we characterize the statistical and computational barriers by deriving information-theoretic lower bounds, designing and analyzing algorithms matching those bounds, and proving computational lower bounds based on the low-degree polynomials conjecture. In particular, we show that the space of the model parameters (i.e., number of planted submatrices, their dimensions, and elevated mean) can be partitioned into three regions: the impossible regime, where all algorithms fail; the hard regime, where while detection or recovery are statistically possible, we give some evidence that polynomial-time algorithm do not exist; and finally the easy regime, where polynomial-time algorithms exist.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2021

Inferring Hidden Structures in Random Graphs

We study the two inference problems of detecting and recovering an isola...
research
02/06/2014

Statistical-Computational Tradeoffs in Planted Problems and Submatrix Localization with a Growing Number of Clusters and Submatrices

We consider two closely related problems: planted clustering and submatr...
research
08/14/2021

On Support Recovery with Sparse CCA: Information Theoretic and Computational Limits

In this paper we consider asymptotically exact support recovery in the c...
research
08/12/2016

Chi-squared Amplification: Identifying Hidden Hubs

We consider the following general hidden hubs model: an n × n random mat...
research
02/13/2023

Detection-Recovery Gap for Planted Dense Cycles

Planted dense cycles are a type of latent structure that appears in many...
research
06/23/2022

Detecting Correlated Gaussian Databases

This paper considers the problem of detecting whether two databases, eac...
research
12/29/2017

Quantum Lower Bounds for Tripartite Versions of the Hidden Shift and the Set Equality Problems

In this paper, we study quantum query complexity of the following rather...

Please sign up or login with your details

Forgot password? Click here to reset