Large-Scale Paralleled Sparse Principal Component Analysis

12/21/2013
by   W. Liu, et al.
0

Principal component analysis (PCA) is a statistical technique commonly used in multivariate data analysis. However, PCA can be difficult to interpret and explain since the principal components (PCs) are linear combinations of the original variables. Sparse PCA (SPCA) aims to balance statistical fidelity and interpretability by approximating sparse PCs whose projections capture the maximal variance of original data. In this paper we present an efficient and paralleled method of SPCA using graphics processing units (GPUs), which can process large blocks of data in parallel. Specifically, we construct parallel implementations of the four optimization formulations of the generalized power method of SPCA (GP-SPCA), one of the most efficient and effective SPCA approaches, on a GPU. The parallel GPU implementation of GP-SPCA (using CUBLAS) is up to eleven times faster than the corresponding CPU implementation (using CBLAS), and up to 107 times faster than a MatLab implementation. Extensive comparative experiments in several real-world datasets confirm that SPCA offers a practical advantage.

READ FULL TEXT

page 6

page 7

page 9

page 11

page 12

research
02/18/2014

High Dimensional Semiparametric Scale-Invariant Principal Component Analysis

We propose a new high dimensional semiparametric principal component ana...
research
12/17/2012

Alternating Maximization: Unifying Framework for 8 Sparse PCA Formulations and Efficient Parallel Codes

Given a multivariate data set, sparse principal component analysis (SPCA...
research
05/11/2020

Solving Large-Scale Sparse PCA to Certifiable (Near) Optimality

Sparse principal component analysis (PCA) is a popular dimensionality re...
research
07/04/2018

Massively-Parallel Break Detection for Satellite Data

The field of remote sensing is nowadays faced with huge amounts of data....
research
12/31/2022

A Study on a User-Controlled Radial Tour for Variable Importance in High-Dimensional Data

Principal component analysis is a long-standing go-to method for explori...
research
11/13/2019

Fast Approximate Time-Delay Estimation in Ultrasound Elastography Using Principal Component Analysis

Time delay estimation (TDE) is a critical and challenging step in all ul...
research
10/16/2018

Fast Randomized PCA for Sparse Data

Principal component analysis (PCA) is widely used for dimension reductio...

Please sign up or login with your details

Forgot password? Click here to reset