Subspace Estimation from Unbalanced and Incomplete Data Matrices: ℓ_2,∞ Statistical Guarantees

10/09/2019
by   Changxiao Cai, et al.
0

This paper is concerned with estimating the column space of an unknown low-rank matrix A^∈R^d_1× d_2, given noisy and partial observations of its entries. There is no shortage of scenarios where the observations — while being too noisy to support faithful recovery of the entire matrix — still convey sufficient information to enable reliable estimation of the column space of interest. This is particularly evident and crucial for the highly unbalanced case where the column dimension d_2 far exceeds the row dimension d_1, which is the focal point of the current paper. We investigate an efficient spectral method, which operates upon the sample Gram matrix with diagonal deletion. We establish statistical guarantees for this method in terms of both ℓ_2 and ℓ_2,∞ estimation accuracy, which improve upon prior results if d_2 is substantially larger than d_1. To illustrate the effectiveness of our findings, we develop consequences of our general theory for three applications of practical importance: (1) tensor completion from noisy data, (2) covariance estimation with missing data, and (3) community recovery in bipartite graphs. Our theory leads to improved performance guarantees for all three cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2013

Low-Rank Matrix and Tensor Completion via Adaptive Sampling

We study low rank matrix and tensor completion and propose novel algorit...
research
05/23/2015

Low-Rank Matrix Recovery from Row-and-Column Affine Measurements

We propose and study a row-and-column affine measurement scheme for low-...
research
01/05/2021

Bayesian Uncertainty Quantification for Low-rank Matrix Completion

We consider the problem of uncertainty quantification for an unknown low...
research
02/06/2020

Optimal Adaptive Matrix Completion

We study the problem of exact completion for m × n sized matrix of rank ...
research
06/20/2023

The Dyson Equalizer: Adaptive Noise Stabilization for Low-Rank Signal Detection and Recovery

Detecting and recovering a low-rank signal in a noisy data matrix is a f...
research
06/26/2018

Main effects and interactions in mixed and incomplete data frames

A mixed data frame (MDF) is a table collecting categorical, numerical an...
research
03/10/2023

Deflated HeteroPCA: Overcoming the curse of ill-conditioning in heteroskedastic PCA

This paper is concerned with estimating the column subspace of a low-ran...

Please sign up or login with your details

Forgot password? Click here to reset