Subject clustering by IF-PCA and several recent methods

06/08/2023
by   Dieyi Chen, et al.
0

Subject clustering (i.e., the use of measured features to cluster subjects, such as patients or cells, into multiple groups) is a problem of great interest. In recent years, many approaches were proposed, among which unsupervised deep learning (UDL) has received a great deal of attention. Two interesting questions are (a) how to combine the strengths of UDL and other approaches, and (b) how these approaches compare to one other. We combine Variational Auto-Encoder (VAE), a popular UDL approach, with the recent idea of Influential Feature PCA (IF-PCA), and propose IF-VAE as a new method for subject clustering. We study IF-VAE and compare it with several other methods (including IF-PCA, VAE, Seurat, and SC3) on 10 gene microarray data sets and 8 single-cell RNA-seq data sets. We find that IF-VAE significantly improves over VAE, but still underperforms IF-PCA. We also find that IF-PCA is quite competitive, which slightly outperforms Seurat and SC3 over the 8 single-cell data sets. IF-PCA is conceptually simple and permits delicate analysis. We demonstrate that IF-PCA is capable of achieving the phase transition in a Rare/Weak model. Comparatively, Seurat and SC3 are more complex and theoretically difficult to analyze (for these reasons, their optimality remains unclear).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/22/2022

Compressibility: Power of PCA in Clustering Problems Beyond Dimensionality Reduction

In this paper we take a step towards understanding the impact of princip...
research
02/24/2015

Phase Transitions for High Dimensional Clustering and Related Problems

Consider a two-class clustering problem where we observe X_i = ℓ_i μ + Z...
research
12/17/2018

Variational Autoencoders Pursue PCA Directions (by Accident)

The Variational Autoencoder (VAE) is a powerful architecture capable of ...
research
12/23/2019

A Compared Study Between Some Subspace Based Algorithms

The technology of face recognition has made some progress in recent year...
research
08/29/2023

Target PCA: Transfer Learning Large Dimensional Panel Data

This paper develops a novel method to estimate a latent factor model for...
research
11/07/2022

Learning Causal Representations of Single Cells via Sparse Mechanism Shift Modeling

Latent variable models such as the Variational Auto-Encoder (VAE) have b...

Please sign up or login with your details

Forgot password? Click here to reset