Read classification using semi-supervised deep learning

04/23/2019
by   Tomislav Šebrek, et al.
0

In this paper, we propose a semi-supervised deep learning method for detecting the specific types of reads that impede the de novo genome assembly process. Instead of dealing directly with sequenced reads, we analyze their coverage graphs converted to 1D-signals. We noticed that specific signal patterns occur in each relevant class of reads. Semi-supervised approach is chosen because manually labelling the data is a very slow and tedious process, so our goal was to facilitate the assembly process with as little labeled data as possible. We tested two models to learn patterns in the coverage graphs: M1+M2 and semi-GAN. We evaluated the performance of each model based on a manually labeled dataset that comprises various reads from multiple reference genomes with respect to the number of labeled examples that were used during the training process. In addition, we embedded our detection in the assembly process which improved the quality of assemblies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2019

A Review of Semi Supervised Learning Theories and Recent Advances

Semi-supervised learning, which has emerged from the beginning of this c...
research
12/19/2018

Semi-Supervised Deep Learning for Abnormality Classification in Retinal Images

Supervised deep learning algorithms have enabled significant performance...
research
08/19/2017

Semi-supervised Conditional GANs

We introduce a new model for building conditional generative models in a...
research
03/05/2023

Neuroevolutionary algorithms driven by neuron coverage metrics for semi-supervised classification

In some machine learning applications the availability of labeled instan...
research
06/04/2020

Semi-supervised and Unsupervised Methods for Heart Sounds Classification in Restricted Data Environments

Automated heart sounds classification is a much-required diagnostic tool...
research
11/25/2019

Detecting Unknown Behaviors by Pre-defined Behaviours: An Bayesian Non-parametric Approach

An automatic mouse behavior recognition system can considerably reduce t...
research
09/13/2021

Specified Certainty Classification, with Application to Read Classification for Reference-Guided Metagenomic Assembly

Specified Certainty Classification (SCC) is a new paradigm for employing...

Please sign up or login with your details

Forgot password? Click here to reset