Deflated HeteroPCA: Overcoming the curse of ill-conditioning in heteroskedastic PCA

03/10/2023
by   Yuchen Zhou, et al.
1

This paper is concerned with estimating the column subspace of a low-rank matrix X^⋆∈ℝ^n_1× n_2 from contaminated data. How to obtain optimal statistical accuracy while accommodating the widest range of signal-to-noise ratios (SNRs) becomes particularly challenging in the presence of heteroskedastic noise and unbalanced dimensionality (i.e., n_2≫ n_1). While the state-of-the-art algorithm emerges as a powerful solution for solving this problem, it suffers from "the curse of ill-conditioning," namely, its performance degrades as the condition number of X^⋆ grows. In order to overcome this critical issue without compromising the range of allowable SNRs, we propose a novel algorithm, called , that achieves near-optimal and condition-number-free theoretical guarantees in terms of both ℓ_2 and ℓ_2,∞ statistical accuracy. The proposed algorithm divides the spectrum of X^⋆ into well-conditioned and mutually well-separated subblocks, and applies to conquer each subblock successively. Further, an application of our algorithm and theory to two canonical examples – the factor model and tensor PCA – leads to remarkable improvement for each application.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2023

The Power of Preconditioning in Overparameterized Low-Rank Matrix Sensing

We propose , a preconditioned gradient descent method to tackle the low-...
research
01/15/2020

Bridging Convex and Nonconvex Optimization in Robust PCA: Noise, Outliers, and Missing Data

This paper delivers improved theoretical guarantees for the convex progr...
research
10/26/2020

Low-Rank Matrix Recovery with Scaled Subgradient Methods: Fast and Robust Convergence Without the Condition Number

Many problems in data science can be treated as estimating a low-rank ma...
research
04/29/2021

Scaling and Scalability: Provable Nonconvex Low-Rank Tensor Estimation from Incomplete Measurements

Tensors, which provide a powerful and flexible model for representing mu...
research
08/04/2020

Well-Conditioned Methods for Ill-Conditioned Systems: Linear Regression with Semi-Random Noise

Classical iterative algorithms for linear system solving and regression ...
research
10/09/2019

Subspace Estimation from Unbalanced and Incomplete Data Matrices: ℓ_2,∞ Statistical Guarantees

This paper is concerned with estimating the column space of an unknown l...
research
04/09/2020

Well conditioned ptychograpic imaging via lost subspace completion

Ptychography, a special case of the phase retrieval problem, is a popula...

Please sign up or login with your details

Forgot password? Click here to reset