All Sparse PCA Models Are Wrong, But Some Are Useful. Part I: Computation of Scores, Residuals and Explained Variance

07/09/2019
by   J. Camacho, et al.
0

Sparse Principal Component Analysis (sPCA) is a popular matrix factorization approach based on Principal Component Analysis (PCA) that combines variance maximization and sparsity with the ultimate goal of improving data interpretation. When moving from PCA to sPCA, there are a number of implications that the practitioner needs to be aware of. A relevant one is that scores and loadings in sPCA may not be orthogonal. For this reason, the traditional way of computing scores, residuals and variance explained that is used in the classical PCA cannot directly be applied to sPCA models. This also affects how sPCA components should be visualized. In this paper we illustrate this problem both theoretically and numerically using simulations for several state-of-the-art sPCA algorithms, and provide proper computation of the different elements mentioned. We show that sPCA approaches present disparate and limited performance when modeling noise-free, sparse data. In a follow-up paper, we discuss the theoretical properties that lead to this problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2020

A New Basis for Sparse PCA

The statistical and computational performance of sparse principal compon...
research
05/01/2017

Group-sparse block PCA and explained variance

The paper addresses the simultneous determination of goup-sparse loading...
research
11/10/2020

Supervised PCA: A Multiobjective Approach

Methods for supervised principal component analysis (SPCA) aim to incorp...
research
10/12/2022

Sparse PCA: a Geometric Approach

We consider the problem of maximizing the variance explained from a data...
research
10/26/2012

Large-Scale Sparse Principal Component Analysis with Application to Text Data

Sparse PCA provides a linear combination of small number of features tha...
research
03/09/2023

Revisiting the relevance of traditional genres: a network analysis of fiction readers' preferences

We investigate how well traditional fiction genres like Fantasy, Thrille...
research
10/01/2020

EigenGame: PCA as a Nash Equilibrium

We present a novel view on principal component analysis (PCA) as a compe...

Please sign up or login with your details

Forgot password? Click here to reset