Cancer Subtyping by Improved Transcriptomic Features Using Vector Quantized Variational Autoencoder

07/20/2022
by   Zheng Chen, et al.
0

Defining and separating cancer subtypes is essential for facilitating personalized therapy modality and prognosis of patients. The definition of subtypes has been constantly recalibrated as a result of our deepened understanding. During this recalibration, researchers often rely on clustering of cancer data to provide an intuitive visual reference that could reveal the intrinsic characteristics of subtypes. The data being clustered are often omics data such as transcriptomics that have strong correlations to the underlying biological mechanism. However, while existing studies have shown promising results, they suffer from issues associated with omics data: sample scarcity and high dimensionality. As such, existing methods often impose unrealistic assumptions to extract useful features from the data while avoiding overfitting to spurious correlations. In this paper, we propose to leverage a recent strong generative model, Vector Quantized Variational AutoEncoder (VQ-VAE), to tackle the data issues and extract informative latent features that are crucial to the quality of subsequent clustering by retaining only information relevant to reconstructing the input. VQ-VAE does not impose strict assumptions and hence its latent features are better representations of the input, capable of yielding superior clustering performance with any mainstream clustering method. Extensive experiments and medical analysis on multiple datasets comprising 10 distinct cancers demonstrate the VQ-VAE clustering results can significantly and robustly improve prognosis over prevalent subtyping systems.

READ FULL TEXT

page 1

page 3

page 5

page 7

page 8

page 10

research
09/24/2019

Supervised Vector Quantized Variational Autoencoder for Learning Interpretable Global Representations

Learning interpretable representations of data remains a central challen...
research
04/03/2020

Epitomic Variational Graph Autoencoder

Variational autoencoder (VAE) is a widely used generative model for unsu...
research
05/16/2022

SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization

One noted issue of vector-quantized variational autoencoder (VQ-VAE) is ...
research
03/23/2023

Clinically Relevant Latent Space Embedding of Cancer Histopathology Slides through Variational Autoencoder Based Image Compression

In this paper, we introduce a Variational Autoencoder (VAE) based traini...
research
02/10/2021

Dynamic β-VAEs for quantifying biodiversity by clustering optically recorded insect signals

While insects are the largest and most diverse group of animals, constit...
research
07/11/2019

retina-VAE: Variationally Decoding the Spectrum of Macular Disease

In this paper, we seek a clinically-relevant latent code for representin...
research
04/02/2022

Cancer Subtyping via Embedded Unsupervised Learning on Transcriptomics Data

Cancer is one of the deadliest diseases worldwide. Accurate diagnosis an...

Please sign up or login with your details

Forgot password? Click here to reset