Robust PCA as Bilinear Decomposition with Outlier-Sparsity Regularization

11/08/2011
by   Gonzalo Mateos, et al.
0

Principal component analysis (PCA) is widely used for dimensionality reduction, with well-documented merits in various applications involving high-dimensional data, including computer vision, preference measurement, and bioinformatics. In this context, the fresh look advocated here permeates benefits from variable selection and compressive sampling, to robustify PCA against outliers. A least-trimmed squares estimator of a low-rank bilinear factor analysis model is shown closely related to that obtained from an ℓ_0-(pseudo)norm-regularized criterion encouraging sparsity in a matrix explicitly modeling the outliers. This connection suggests robust PCA schemes based on convex relaxation, which lead naturally to a family of robust estimators encompassing Huber's optimal M-class as a special case. Outliers are identified by tuning a regularization parameter, which amounts to controlling sparsity of the outlier matrix along the whole robustification path of (group) least-absolute shrinkage and selection operator (Lasso) solutions. Beyond its neat ties to robust statistics, the developed outlier-aware PCA framework is versatile to accommodate novel and scalable algorithms to: i) track the low-rank signal subspace robustly, as new data are acquired in real time; and ii) determine principal components robustly in (possibly) infinite-dimensional feature spaces. Synthetic and real data tests corroborate the effectiveness of the proposed robust PCA schemes, when used to identify aberrant responses in personality assessment surveys, as well as unveil communities in social networks, and intruders from video surveillance data.

READ FULL TEXT

page 29

page 30

research
04/04/2011

Robust Nonparametric Regression via Sparsity Control with Application to Load Curve Data Cleansing

Nonparametric methods are widely applicable to statistical inference pro...
research
04/23/2015

Robust Principal Component Analysis on Graphs

Principal Component Analysis (PCA) is the most widely used tool for line...
research
10/11/2021

Learned Robust PCA: A Scalable Deep Unfolding Approach for High-Dimensional Outlier Detection

Robust principal component analysis (RPCA) is a critical tool in modern ...
research
05/11/2020

Robust PCA via Regularized REAPER with a Matrix-Free Proximal Algorithm

Principal component analysis (PCA) is known to be sensitive to outliers,...
research
11/26/2017

Robust PCA and Robust Subspace Tracking

Principal Components Analysis (PCA) is one of the most widely used dimen...
research
05/17/2017

Maximum Margin Principal Components

Principal Component Analysis (PCA) is a very successful dimensionality r...
research
03/25/2014

Selective Factor Extraction in High Dimensions

This paper studies simultaneous feature selection and extraction in supe...

Please sign up or login with your details

Forgot password? Click here to reset