OmiEmbed: reconstruct comprehensive phenotypic information from multi-omics data using multi-task deep learning

02/03/2021
by   Xiaoyu Zhang, et al.
13

High-dimensional omics data contains intrinsic biomedical information that is crucial for personalised medicine. Nevertheless, it is challenging to capture them from the genome-wide data due to the large number of molecular features and small number of available samples, which is also called "the curse of dimensionality" in machine learning. To tackle this problem and pave the way for machine learning aided precision medicine, we proposed a unified multi-task deep learning framework called OmiEmbed to capture a holistic and relatively precise profile of phenotype from high-dimensional omics data. The deep embedding module of OmiEmbed learnt an omics embedding that mapped multiple omics data types into a latent space with lower dimensionality. Based on the new representation of multi-omics data, different downstream networks of OmiEmbed were trained together with the multi-task strategy to predict the comprehensive phenotype profile of each sample. We trained the model on two publicly available omics datasets to evaluate the performance of OmiEmbed. The OmiEmbed model achieved promising results for multiple downstream tasks including dimensionality reduction, tumour type classification, multi-omics integration, demographic and clinical feature reconstruction, and survival prediction. Instead of training and applying different downstream networks separately, the multi-task strategy combined them together and conducted multiple tasks simultaneously and efficiently. The model achieved better performance with the multi-task strategy comparing to training them individually. OmiEmbed is a powerful tool to accurately capture comprehensive phenotypic information from high-dimensional omics data and has a great potential to facilitate more accurate and personalised clinical decision making.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2019

Integrated Multi-omics Analysis Using Variational Autoencoders: Application to Pan-cancer Classification

Different aspects of a clinical sample can be revealed by multiple types...
research
02/03/2022

SubOmiEmbed: Self-supervised Representation Learning of Multi-omics Data for Cancer Type Classification

For personalized medicines, very crucial intrinsic information is presen...
research
03/01/2022

Multi-Task Multi-Scale Learning For Outcome Prediction in 3D PET Images

Background and Objectives: Predicting patient response to treatment and ...
research
09/12/2022

CustOmics: A versatile deep-learning based strategy for multi-omics integration

Recent advances in high-throughput sequencing technologies have enabled ...
research
07/01/2020

Multi-Task Variational Information Bottleneck

In this paper we propose a multi-task deep learning model called multi-t...
research
02/28/2022

One Model is All You Need: Multi-Task Learning Enables Simultaneous Histology Image Segmentation and Classification

The recent surge in performance for image analysis of digitised patholog...
research
03/30/2021

High-Dimensional Bayesian Optimization with Multi-Task Learning for RocksDB

RocksDB is a general-purpose embedded key-value store used in multiple d...

Please sign up or login with your details

Forgot password? Click here to reset