Recovering Data Permutation from Noisy Observations: The Linear Regime

05/15/2020
by   Minoh Jeong, et al.
0

This paper considers a noisy data structure recovery problem. The goal is to investigate the following question: Given a noisy data observation, according to which permutation was the original data sorted? The focus is on scenarios where data is generated according to an isotropic Gaussian distribution, and the perturbation consists of adding Gaussian noise with an arbitrary covariance matrix. This problem is posed within a hypothesis testing framework. The objective is to study the linear regime in which the optimal decoder has a polynomial complexity in the data size, and it declares the permutation by simply computing a linear function of the noisy observation. The main result of the paper is a complete characterization of the linear regime in terms of the noise covariance matrix. Specifically, it is shown that this matrix must have a very flat spectrum with at most three distinct eigenvalues to induce the linear regime. Several practically relevant implications of this result are discussed, and the error probability incurred by the decision criterion in the linear regime is also characterized. A core technical component consists of using linear algebraic and geometric tools, such as Steiner symmetrization.

READ FULL TEXT

page 7

page 10

page 11

page 12

research
05/07/2021

Retrieving Data Permutations from Noisy Observations: High and Low Noise Asymptotics

This paper considers the problem of recovering the permutation of an n-d...
research
08/09/2016

Linear Regression with an Unknown Permutation: Statistical and Computational Limits

Consider a noisy linear observation model with an unknown permutation, b...
research
01/24/2021

Testing for subsphericity when n and p are of different asymptotic order

In this short note, we extend a classical test of subsphericity, based o...
research
01/30/2016

Spectrum Estimation from Samples

We consider the problem of approximating the set of eigenvalues of the c...
research
05/04/2018

Estimating Learnability in the Sublinear Data Regime

We consider the problem of estimating how well a model class is capable ...
research
02/22/2018

Robustness of classifiers to uniform ℓ_p and Gaussian noise

We study the robustness of classifiers to various kinds of random noise ...
research
04/13/2021

On Minimax Detection of Gaussian Stochastic Sequences and Gaussian Stationary Signals

Minimax detection of Gaussian stochastic sequences (signals) with unknow...

Please sign up or login with your details

Forgot password? Click here to reset