Supervised Discriminative Sparse PCA for Com-Characteristic Gene Selection and Tumor Classification on Multiview Biological Data

05/28/2019
by   Chun-Mei Feng, et al.
1

Principal Component Analysis (PCA) has been used to study the pathogenesis of diseases. To enhance the interpretability of classical PCA, various improved PCA methods have been proposed to date. Among these, a typical method is the so-called sparse PCA, which focuses on seeking sparse loadings. However, the performance of these methods is still far from satisfactory due to their limitation of using unsupervised learning methods; moreover, the class ambiguity within the sample is high. To overcome this problem, this study developed a new PCA method, which is named the Supervised Discriminative Sparse PCA (SDSPCA). The main innovation of this method is the incorporation of discriminative information and sparsity into the PCA model. Specifically, in contrast to the traditional sparse PCA, which imposes sparsity on the loadings, here, sparse components are obtained to represent the data. Furthermore, via linear transformation, the sparse components approximate the given label information. On the one hand, sparse components improve interpretability over traditional PCA, while on the other hand, they are have discriminative abilities suitable for classification purposes. A simple algorithm is developed and its convergence proof is provided. The SDSPCA has been applied to common characteristic gene selection (com-characteristic gene) and tumor classification on multi-view biological data. The sparsity and classification performance of the SDSPCA are empirically verified via abundant, reasonable, and effective experiments, and the obtained results demonstrate that SDSPCA outperforms other state-of-the-art methods.

READ FULL TEXT

page 1

page 12

research
03/06/2014

Sparse Principal Component Analysis via Rotation and Truncation

Sparse principal component analysis (sparse PCA) aims at finding a spars...
research
05/19/2016

Bayesian Variable Selection for Globally Sparse Probabilistic PCA

Sparse versions of principal component analysis (PCA) have imposed thems...
research
01/21/2019

Dual Graph-Laplacian PCA: A Closed-Form Solution for Bi-clustering to Find "Checkerboard" Structures on Gene Expression Data

In the context of cancer, internal "checkerboard" structures are normall...
research
02/23/2015

Optimal Sparse Linear Auto-Encoders and Sparse PCA

Principal components analysis (PCA) is the optimal linear auto-encoder o...
research
09/06/2016

Structured Sparse Principal Components Analysis with the TV-Elastic Net penalty

Principal component analysis (PCA) is an exploratory tool widely used in...
research
02/23/2015

Rectified Factor Networks

We propose rectified factor networks (RFNs) to efficiently construct ver...
research
01/27/2014

Sparsistency and agnostic inference in sparse PCA

The presence of a sparse "truth" has been a constant assumption in the t...

Please sign up or login with your details

Forgot password? Click here to reset