A New Basis for Sparse PCA

07/01/2020
by   Fan Chen, et al.
17

The statistical and computational performance of sparse principal component analysis (PCA) can be dramatically improved when the principal components are allowed to be sparse in a rotated eigenbasis. For this, we propose a new method for sparse PCA. In the simplest version of the algorithm, the component scores and loadings are initialized with a low-rank singular value decomposition. Then, the singular vectors are rotated with orthogonal rotations to make them approximately sparse. Finally, soft-thresholding is applied to the rotated singular vectors. This approach differs from prior approaches because it uses an orthogonal rotation to approximate a sparse basis. Our sparse PCA framework is versatile; for example, it extends naturally to the two-way analysis of a data matrix for simultaneous dimensionality reduction of rows and columns. We identify the close relationship between sparse PCA and independent component analysis for separating sparse signals. We provide empirical evidence showing that for the same level of sparsity, the proposed sparse PCA method is more stable and can explain more variance compared to alternative methods. Through three applications—sparse coding of images, analysis of transcriptome sequencing data, and large-scale clustering of Twitter accounts, we demonstrate the usefulness of sparse PCA in exploring modern multivariate data.

READ FULL TEXT

page 21

page 24

page 29

research
03/06/2014

Sparse Principal Component Analysis via Rotation and Truncation

Sparse principal component analysis (sparse PCA) aims at finding a spars...
research
07/09/2019

All Sparse PCA Models Are Wrong, But Some Are Useful. Part I: Computation of Scores, Residuals and Explained Variance

Sparse Principal Component Analysis (sPCA) is a popular matrix factoriza...
research
12/08/2020

Sparse Correspondence Analysis for Contingency Tables

Since the introduction of the lasso in regression, various sparse method...
research
04/26/2017

SOFAR: large-scale association network learning

Many modern big data applications feature large scale in both numbers of...
research
05/04/2015

Self-Expressive Decompositions for Matrix Approximation and Clustering

Data-aware methods for dimensionality reduction and matrix decomposition...
research
11/30/2012

A recursive divide-and-conquer approach for sparse principal component analysis

In this paper, a new method is proposed for sparse PCA based on the recu...
research
07/25/2023

Do algorithms and barriers for sparse principal component analysis extend to other structured settings?

We study a principal component analysis problem under the spiked Wishart...

Please sign up or login with your details

Forgot password? Click here to reset