Gradient-based Sparse Principal Component Analysis with Extensions to Online Learning

11/19/2019
by   Yixuan Qiu, et al.
14

Sparse principal component analysis (PCA) is an important technique for dimensionality reduction of high-dimensional data. However, most existing sparse PCA algorithms are based on non-convex optimization, which provide little guarantee on the global convergence. Sparse PCA algorithms based on a convex formulation, for example the Fantope projection and selection (FPS), overcome this difficulty, but are computationally expensive. In this work we study sparse PCA based on the convex FPS formulation, and propose a new algorithm that is computationally efficient and applicable to large and high-dimensional data sets. Nonasymptotic and explicit bounds are derived for both the optimization error and the statistical accuracy, which can be used for testing and inference problems. We also extend our algorithm to online learning problems, where data are obtained in a streaming fashion. The proposed algorithm is applied to high-dimensional gene expression data for the detection of functional gene groups.

READ FULL TEXT
research
01/30/2021

Spike and slab Bayesian sparse principal component analysis

Sparse principal component analysis (PCA) is a popular tool for dimensio...
research
04/20/2019

High Dimensional Process Monitoring Using Robust Sparse Probabilistic Principal Component Analysis

High dimensional data has introduced challenges that are difficult to ad...
research
11/25/2018

Sparse PCA from Sparse Linear Regression

Sparse Principal Component Analysis (SPCA) and Sparse Linear Regression ...
research
10/21/2015

Dimensionality Reduction for Binary Data through the Projection of Natural Parameters

Principal component analysis (PCA) for binary data, known as logistic PC...
research
03/27/2019

An Alternating Manifold Proximal Gradient Method for Sparse PCA and Sparse CCA

Sparse principal component analysis (PCA) and sparse canonical correlati...
research
09/29/2022

Automatic sparse PCA for high-dimensional data

Sparse principal component analysis (SPCA) methods have proven to effici...
research
12/29/2022

Theoretical Guarantees for Sparse Principal Component Analysis based on the Elastic Net

Sparse principal component analysis (SPCA) is widely used for dimensiona...

Please sign up or login with your details

Forgot password? Click here to reset