Robust factored principal component analysis for matrix-valued outlier accommodation and detection

12/13/2021
by   Xuan Ma, et al.
0

Principal component analysis (PCA) is a popular dimension reduction technique for vector data. Factored PCA (FPCA) is a probabilistic extension of PCA for matrix data, which can substantially reduce the number of parameters in PCA while yield satisfactory performance. However, FPCA is based on the Gaussian assumption and thereby susceptible to outliers. Although the multivariate t distribution as a robust modeling tool for vector data has a very long history, its application to matrix data is very limited. The main reason is that the dimension of the vectorized matrix data is often very high and the higher the dimension, the lower the breakdown point that measures the robustness. To solve the robustness problem suffered by FPCA and make it applicable to matrix data, in this paper we propose a robust extension of FPCA (RFPCA), which is built upon a t-type distribution called matrix-variate t distribution. Like the multivariate t distribution, the matrix-variate t distribution can adaptively down-weight outliers and yield robust estimates. We develop a fast EM-type algorithm for parameter estimation. Experiments on synthetic and real-world datasets reveal that RFPCA is compared favorably with several related methods and RFPCA is a simple but powerful tool for matrix-valued outlier detection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2019

Self-Paced Probabilistic Principal Component Analysis for Data with Outliers

Principal Component Analysis (PCA) is a popular tool for dimensionality ...
research
05/07/2019

On Weighted Multivariate Sign Functions

Multivariate sign functions are often used for robust estimation and inf...
research
11/19/2021

Gaussian Determinantal Processes: a new model for directionality in data

Determinantal point processes (a.k.a. DPPs) have recently become popular...
research
01/31/2019

Phase Transition in the Recovery of Rank One Matrices Corrupted by Gaussian Noise

In datasets where the number of parameters is fixed and the number of sa...
research
11/22/2019

2SDR: Applying Kronecker Envelope PCA to denoise Cryo-EM Images

Principal component analysis (PCA) is arguably the most widely used dime...
research
11/25/2019

Matrix Normal PCA for Interpretable Dimension Reduction and Graphical Noise Modeling

Principal component analysis (PCA) is one of the most widely used dimens...

Please sign up or login with your details

Forgot password? Click here to reset