Uncertainty-Aware Principal Component Analysis

05/03/2019
by   Jochen Görtler, et al.
0

We present a technique to perform dimensionality reduction on data that is subject to uncertainty. Our method is a generalization of traditional principal component analysis (PCA) to multivariate probability distributions. In comparison to non-linear methods, linear dimensionality reduction techniques have the advantage that the characteristics of such probability distributions remain intact after projection. We derive a representation of the covariance matrix that respects potential uncertainty in each of the observations, building the mathematical foundation of our new method uncertainty-aware PCA. In addition to the accuracy and performance gained by our approach over sampling-based strategies, our formulation allows us to perform sensitivity analysis with regard to the uncertainty in the data. For this, we propose factor traces as a novel visualization that enables us to better understand the influence of uncertainty on the chosen principal components. We provide multiple examples of our technique using real-world datasets and show how to propagate multivariate normal distributions through PCA in closed-form. Furthermore, we discuss extensions and limitations of our approach.

READ FULL TEXT
research
11/06/2022

Cauchy robust principal component analysis with applications to high-deimensional data sets

Principal component analysis (PCA) is a standard dimensionality reductio...
research
06/07/2023

Yet Another Algorithm for Supervised Principal Component Analysis: Supervised Linear Centroid-Encoder

We propose a new supervised dimensionality reduction technique called Su...
research
02/09/2022

Non-Linear Spectral Dimensionality Reduction Under Uncertainty

In this paper, we consider the problem of non-linear dimensionality redu...
research
02/17/2017

Maximally Correlated Principal Component Analysis

In the era of big data, reducing data dimensionality is critical in many...
research
03/08/2017

Exact Dimensionality Selection for Bayesian PCA

We present a Bayesian model selection approach to estimate the intrinsic...
research
06/20/2020

Estimating Model Uncertainty of Neural Networks in Sparse Information Form

We present a sparse representation of model uncertainty for Deep Neural ...
research
07/12/2022

Understanding High Dimensional Spaces through Visual Means Employing Multidimensional Projections

Data visualisation helps understanding data represented by multiple vari...

Please sign up or login with your details

Forgot password? Click here to reset