Integrated Multi-omics Analysis Using Variational Autoencoders: Application to Pan-cancer Classification

08/17/2019
by   Xiaoyu Zhang, et al.
0

Different aspects of a clinical sample can be revealed by multiple types of omics data. Integrated analysis of multi-omics data provides a comprehensive view of patients, which has the potential to facilitate more accurate clinical decision making. However, omics data are normally high dimensional with large number of molecular features and relatively small number of available samples with clinical labels. The "dimensionality curse" makes it challenging to train a machine learning model using high dimensional omics data like DNA methylation and gene expression profiles. Here we propose an end-to-end deep learning model called OmiVAE to extract low dimensional features and classify samples from multi-omics data. OmiVAE combines the basic structure of variational autoencoders with a classification network to achieve task-oriented feature extraction and multi-class classification. The training procedure of OmiVAE is comprised of an unsupervised phase without the classifier and a supervised phase with the classifier. During the unsupervised phase, a hierarchical cluster structure of samples can be automatically formed without the need for labels. And in the supervised phase, OmiVAE achieved an average classification accuracy of 97.49 normal samples, which shows better performance than other existing methods. The OmiVAE model learned from multi-omics data outperformed that using only one type of omics data, which indicates that the complementary information from different omics datatypes provides useful insights for biomedical tasks like cancer classification.

READ FULL TEXT
research
02/03/2022

SubOmiEmbed: Self-supervised Representation Learning of Multi-omics Data for Cancer Type Classification

For personalized medicines, very crucial intrinsic information is presen...
research
02/03/2021

OmiEmbed: reconstruct comprehensive phenotypic information from multi-omics data using multi-task deep learning

High-dimensional omics data contains intrinsic biomedical information th...
research
06/09/2023

Contrastive Learning for Predicting Cancer Prognosis Using Gene Expression Values

Several artificial neural networks (ANNs) have recently been developed a...
research
05/26/2021

XOmiVAE: an interpretable deep learning model for cancer classification using high-dimensional omics data

Deep learning based approaches have proven promising to model omics data...
research
11/20/2019

Learning Embeddings from Cancer Mutation Sets for Classification Tasks

Analysis of somatic mutation profiles from cancer patients is essential ...
research
12/04/2019

Mining Domain Knowledge: Improved Framework towards Automatically Standardizing Anatomical Structure Nomenclature in Radiotherapy

Automatically standardizing nomenclature for anatomical structures in ra...
research
08/07/2018

Inferring Molecular Pathology and micro-RNA Transcriptome from mRNA Profiles of Cancer Biopsies through Deep Multi-Task Learning

Despite great advances, molecular cancer pathology is often limited to u...

Please sign up or login with your details

Forgot password? Click here to reset