Towards a Theoretical Analysis of PCA for Heteroscedastic Data

10/12/2016
by   David Hong, et al.
0

Principal Component Analysis (PCA) is a method for estimating a subspace given noisy samples. It is useful in a variety of problems ranging from dimensionality reduction to anomaly detection and the visualization of high dimensional data. PCA performs well in the presence of moderate noise and even with missing data, but is also sensitive to outliers. PCA is also known to have a phase transition when noise is independent and identically distributed; recovery of the subspace sharply declines at a threshold noise variance. Effective use of PCA requires a rigorous understanding of these behaviors. This paper provides a step towards an analysis of PCA for samples with heteroscedastic noise, that is, samples that have non-uniform noise variances and so are no longer identically distributed. In particular, we provide a simple asymptotic prediction of the recovery of a one-dimensional subspace from noisy heteroscedastic samples. The prediction enables: a) easy and efficient calculation of the asymptotic performance, and b) qualitative reasoning to understand how PCA is impacted by heteroscedasticity (such as outliers).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2018

Optimally Weighted PCA for High-Dimensional Heteroscedastic Data

Modern applications increasingly involve high-dimensional and heterogene...
research
11/18/2021

Subspace Graph Physics: Real-Time Rigid Body-Driven Granular Flow Simulation

An important challenge in robotics is understanding the interactions bet...
research
05/16/2010

On the Subspace of Image Gradient Orientations

We introduce the notion of Principal Component Analysis (PCA) of image g...
research
11/20/2011

Non-Asymptotic Analysis of Tangent Space Perturbation

Constructing an efficient parameterization of a large, noisy data set of...
research
09/16/2020

PCA Reduced Gaussian Mixture Models with Applications in Superresolution

Despite the rapid development of computational hardware, the treatment o...
research
09/19/2017

Finite Sample Guarantees for PCA in Non-Isotropic and Data-Dependent Noise

This work obtains novel finite sample guarantees for Principal Component...
research
02/22/2023

On the efficiency-loss free ordering-robustness of product-PCA

This article studies the robustness of the eigenvalue ordering, an impor...

Please sign up or login with your details

Forgot password? Click here to reset