Matrix Normal PCA for Interpretable Dimension Reduction and Graphical Noise Modeling

11/25/2019
by   Chihao Zhang, et al.
0

Principal component analysis (PCA) is one of the most widely used dimension reduction and multivariate statistical techniques. From a probabilistic perspective, PCA seeks a low-dimensional representation of data in the presence of independent identical Gaussian noise. Probabilistic PCA (PPCA) and its variants have been extensively studied for decades. Most of them assume the underlying noise follows a certain independent identical distribution. However, the noise in the real world is usually complicated and structured. To address this challenge, some non-linear variants of PPCA have been proposed. But those methods are generally difficult to interpret. To this end, we propose a powerful and intuitive PCA method (MN-PCA) through modeling the graphical noise by the matrix normal distribution, which enables us to explore the structure of noise in both the feature space and the sample space. MN-PCA obtains a low-rank representation of data and the structure of noise simultaneously. And it can be explained as approximating data over the generalized Mahalanobis distance. We develop two algorithms to solve this model: one maximizes the regularized likelihood, the other exploits the Wasserstein distance, which is more robust. Extensive experiments on various data demonstrate their effectiveness.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/24/2020

Torus Probabilistic Principal Component Analysis

One of the most common problems that any technique encounters is the hig...
research
12/13/2021

Robust factored principal component analysis for matrix-valued outlier accommodation and detection

Principal component analysis (PCA) is a popular dimension reduction tech...
research
11/05/2022

Efficient Convex PCA with applications to Wasserstein geodesic PCA and ranked data

Convex PCA, which was introduced by Bigot et al., is a dimension reducti...
research
01/31/2019

Phase Transition in the Recovery of Rank One Matrices Corrupted by Gaussian Noise

In datasets where the number of parameters is fixed and the number of sa...
research
08/22/2018

XPCA: Extending PCA for a Combination of Discrete and Continuous Variables

Principal component analysis (PCA) is arguably the most popular tool in ...
research
07/06/2023

ALPCAH: Sample-wise Heteroscedastic PCA with Tail Singular Value Regularization

Principal component analysis (PCA) is a key tool in the field of data di...
research
10/27/2021

Poisson PCA for matrix count data

We develop a dimension reduction framework for data consisting of matric...

Please sign up or login with your details

Forgot password? Click here to reset