Disease Classification in Metagenomics with 2D Embeddings and Deep Learning

06/23/2018
by   Thanh Hai Nguyen, et al.
0

Deep learning (DL) techniques have shown unprecedented success when applied to images, waveforms, and text. Generally, when the sample size (N) is much bigger than the number of features (d), DL often outperforms other machine learning (ML) techniques, often through the use of Convolutional Neural Networks (CNNs). However, in many bioinformatics fields (including metagenomics), we encounter the opposite situation where d is significantly greater than N. In these situations, applying DL techniques would lead to severe overfitting. Here we aim to improve classification of various diseases with metagenomic data through the use of CNNs. For this we proposed to represent metagenomic data as images. The proposed Met2Img approach relies on taxonomic and t-SNE embeddings to transform abundance data into "synthetic images". We applied our approach to twelve benchmark data sets including more than 1400 metagenomic samples. Our results show significant improvements over the state-of-the-art algorithms (Random Forest (RF), Support Vector Machine (SVM)). We observe that the integration of phylogenetic information alongside abundance data improves classification. The proposed approach is not only important in classification setting but also allows to visualize complex metagenomic data. The Met2Img is implemented in Python.

READ FULL TEXT
research
12/01/2017

Deep Learning for Metagenomic Data: using 2D Embeddings and Convolutional Neural Networks

Deep learning (DL) techniques have had unprecedented success when applie...
research
09/12/2023

Selection of contributing factors for predicting landslide susceptibility using machine learning and deep learning models

Landslides are a common natural disaster that can cause casualties, prop...
research
12/22/2018

Image Embedding of PMU Data for Deep Learning towards Transient Disturbance Classification

This paper presents a study on power grid disturbance classification by ...
research
06/12/2018

DeepTerramechanics: Terrain Classification and Slip Estimation for Ground Robots via Deep Learning

Terramechanics plays a critical role in the areas of ground vehicles and...
research
09/09/2020

Method for classifying a noisy Raman spectrum based on a wavelet transform and a deep neural network

This paper proposes a new framework based on a wavelet transform and dee...
research
08/03/2021

AI Based Waste classifier with Thermo-Rapid Composting

Waste management is a certainly a very complex and difficult process esp...
research
09/26/2019

Deep Learning and Random Forest-Based Augmentation of sRNA Expression Profiles

The lack of well-structured annotations in a growing amount of RNA expre...

Please sign up or login with your details

Forgot password? Click here to reset