PCA matrix denoising is uniform

06/22/2023
by   Xin T. Tong, et al.
0

Principal component analysis (PCA) is a simple and popular tool for processing high-dimensional data. We investigate its effectiveness for matrix denoising. We assume i.i.d. high dimensional Gaussian noises with standard deviation σ are added to clean data generated from a low dimensional subspace. We show that the distance between each pair of PCA-denoised data point and the clean data point is uniformly bounded by (σ), assuming a low-rank data matrix with mild singular value assumptions. We show such a condition could arise even if the data lies on curves. We then provide a general lower bound for the error of the denoised data matrix, which indicates PCA denoising gives a uniform error bound that is rate-optimal. Furthermore, we examine how the error bound impacts downstream applications such as empirical risk minimization, clustering, and manifold learning. Numerical results validate our theoretical findings and reveal the importance of the uniform error.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/16/2016

Near-Optimal Stochastic Approximation for Online Principal Component Estimation

Principal component analysis (PCA) has been a prominent tool for high-di...
research
07/06/2023

ALPCAH: Sample-wise Heteroscedastic PCA with Tail Singular Value Regularization

Principal component analysis (PCA) is a key tool in the field of data di...
research
02/08/2022

Entrywise Recovery Guarantees for Sparse PCA via Sparsistent Algorithms

Sparse Principal Component Analysis (PCA) is a prevalent tool across a p...
research
05/18/2023

High-dimensional Asymptotics of Denoising Autoencoders

We address the problem of denoising data from a Gaussian mixture using a...
research
10/21/2015

Dimensionality Reduction for Binary Data through the Projection of Natural Parameters

Principal component analysis (PCA) for binary data, known as logistic PC...
research
10/11/2021

Learned Robust PCA: A Scalable Deep Unfolding Approach for High-Dimensional Outlier Detection

Robust principal component analysis (RPCA) is a critical tool in modern ...
research
06/26/2019

Principal Component Analysis for Multivariate Extremes

The first order behavior of multivariate heavy-tailed random vectors abo...

Please sign up or login with your details

Forgot password? Click here to reset