The Power of Preconditioning in Overparameterized Low-Rank Matrix Sensing

02/02/2023
by   Xingyu Xu, et al.
0

We propose , a preconditioned gradient descent method to tackle the low-rank matrix sensing problem when the true rank is unknown, and when the matrix is possibly ill-conditioned. Using overparametrized factor representations, starts from a small random initialization, and proceeds by gradient descent with a specific form of damped preconditioning to combat bad curvatures induced by overparameterization and ill-conditioning. At the expense of light computational overhead incurred by preconditioners, is remarkably robust to ill-conditioning compared to vanilla gradient descent () even with overprameterization. Specifically, we show that, under the Gaussian design, converges to the true low-rank matrix at a constant linear rate after a small number of iterations that scales only logarithmically with respect to the condition number and the problem dimension. This significantly improves over the convergence rate of vanilla which suffers from a polynomial dependency on the condition number. Our work provides evidence on the power of preconditioning in accelerating the convergence without hurting generalization in overparameterized learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2020

Accelerating Ill-Conditioned Low-Rank Matrix Estimation via Scaled Gradient Descent

Low-rank matrix estimation is a canonical problem that finds numerous ap...
research
03/10/2023

Deflated HeteroPCA: Overcoming the curse of ill-conditioning in heteroskedastic PCA

This paper is concerned with estimating the column subspace of a low-ran...
research
10/26/2020

Low-Rank Matrix Recovery with Scaled Subgradient Methods: Fast and Robust Convergence Without the Condition Number

Many problems in data science can be treated as estimating a low-rank ma...
research
04/22/2019

Low-rank matrix recovery with composite optimization: good conditioning and rapid convergence

The task of recovering a low-rank matrix from its noisy linear measureme...
research
01/27/2021

On the computational and statistical complexity of over-parameterized matrix sensing

We consider solving the low rank matrix sensing problem with Factorized ...
research
03/24/2023

Implicit Balancing and Regularization: Generalization and Convergence Guarantees for Overparameterized Asymmetric Matrix Sensing

Recently, there has been significant progress in understanding the conve...
research
11/15/2022

Stable rank-adaptive Dynamically Orthogonal Runge-Kutta schemes

We develop two new sets of stable, rank-adaptive Dynamically Orthogonal ...

Please sign up or login with your details

Forgot password? Click here to reset