Entrywise Estimation of Singular Vectors of Low-Rank Matrices with Heteroskedasticity and Dependence

05/27/2021
by   Joshua Agterberg, et al.
0

We propose an estimator for the singular vectors of high-dimensional low-rank matrices corrupted by additive subgaussian noise, where the noise matrix is allowed to have dependence within rows and heteroskedasticity between them. We prove finite-sample ℓ_2,∞ bounds and a Berry-Esseen theorem for the individual entries of the estimator, and we apply these results to high-dimensional mixture models. Our Berry-Esseen theorem clearly shows the geometric relationship between the signal matrix, the covariance structure of the noise, and the distribution of the errors in the singular vector estimation task. These results are illustrated in numerical simulations. Unlike previous results of this type, which rely on assumptions of gaussianity or independence between the entries of the additive noise, handling the dependence between entries in the proofs of these results requires careful leave-one-out analysis and conditioning arguments. Our results depend only on the signal-to-noise ratio, the sample size, and the spectral properties of the signal matrix.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2022

Fundamental Limits of Low-Rank Matrix Estimation with Diverging Aspect Ratios

We consider the problem of estimating the factors of a low-rank n × d ma...
research
07/07/2022

Optimal shrinkage of singular values under high-dimensional noise with separable covariance structure

We consider an optimal shrinkage algorithm that depends on an effective ...
research
02/07/2023

Mismatched estimation of non-symmetric rank-one matrices corrupted by structured noise

We study the performance of a Bayesian statistician who estimates a rank...
research
05/04/2018

Global testing under the sparse alternatives for single index models

For the single index model y=f(β^τx,ϵ) with Gaussian design, and β is a...
research
11/04/2014

A random algorithm for low-rank decomposition of large-scale matrices with missing entries

A Random SubMatrix method (RSM) is proposed to calculate the low-rank de...
research
12/20/2019

CDPA: Common and Distinctive Pattern Analysis between High-dimensional Datasets

A representative model in integrative analysis of two high-dimensional d...
research
08/01/2020

Simpler Proofs for Approximate Factor Models of Large Dimensions

Estimates of the approximate factor model are increasingly used in empir...

Please sign up or login with your details

Forgot password? Click here to reset