Machine Learning based Prediction of Hierarchical Classification of Transposable Elements

07/02/2019
by   Manisha Panta, et al.
0

Transposable Elements (TEs) or jumping genes are the DNA sequences that have an intrinsic capability to move within a host genome from one genomic location to another. Studies show that the presence of a TE within or adjacent to a functional gene may alter its expression. TEs can also cause an increase in the rate of mutation and can even mediate duplications and large insertions and deletions in the genome, promoting gross genetic rearrangements. Thus, the proper classification of the identified jumping genes is essential to understand their genetic and evolutionary effects in the genome. While computational methods have been developed that perform either binary classification or multi-label classification of TEs, few studies have focused on their hierarchical classification. The state-of-the-art machine learning classification method utilizes a Multi-Layer Perceptron (MLP), a class of neural network, for hierarchical classification of TEs. However, the existing methods have limited accuracy in classifying TEs. A more effective classifier, which can explain the role of TEs in germline and somatic evolution, is needed. In this study, we examine the performance of a variety of machine learning (ML) methods. And eventually, propose a robust approach for the hierarchical classification of TEs, with higher accuracy, using Support Vector Machines (SVM).

READ FULL TEXT
research
03/23/2022

A Top-down Supervised Learning Approach to Hierarchical Multi-label Classification in Networks

Node classification is the task of inferring or predicting missing node ...
research
10/21/2021

ML with HE: Privacy Preserving Machine Learning Inferences for Genome Studies

Preserving the privacy and security of big data in the context of cloud ...
research
11/25/2020

Large-scale machine learning-based phenotyping significantly improves genomic discovery for optic nerve head morphology

Genome-wide association studies (GWAS) require accurate cohort phenotypi...
research
05/17/2021

Comparison of machine learning and deep learning techniques in promoter prediction across diverse species

Gene promoters are the key DNA regulatory elements positioned around the...
research
04/19/2019

Random Fragments Classification of Microbial Marker Clades with Multi-class SVM and N-Best Algorithm

Microbial clades modeling is a challenging problem in biology based on m...
research
09/20/2013

mTim: Rapid and accurate transcript reconstruction from RNA-Seq data

Recent advances in high-throughput cDNA sequencing (RNA-Seq) technology ...
research
10/12/2020

A Neurochaos Learning Architecture for Genome Classification

There has been empirical evidence of presence of non-linearity and chaos...

Please sign up or login with your details

Forgot password? Click here to reset