Log In Sign Up

Macromolecule Classification Based on the Amino-acid Sequence

by   Faisal Ghaffar, et al.

Deep learning is playing a vital role in every field which involves data. It has emerged as a strong and efficient framework that can be applied to a broad spectrum of complex learning problems which were difficult to solve using traditional machine learning techniques in the past. In this study we focused on classification of protein sequences with deep learning techniques. The study of amino acid sequence is vital in life sciences. We used different word embedding techniques from Natural Language processing to represent the amino acid sequence as vectors. Our main goal was to classify sequences to four group of classes, that are DNA, RNA, Protein and hybrid. After several tests we have achieved almost 99 LSTM, Bidirectional LSTM, and GRU.


Protein sequence design with deep generative models

Protein engineering seeks to identify protein sequences with optimized p...

Align-gram : Rethinking the Skip-gram Model for Protein Sequence Analysis

Background: The inception of next generations sequencing technologies ha...

Deep Learning Methods for Protein Family Classification on PDB Sequencing Data

Composed of amino acid chains that influence how they fold and thus dict...

Bioinformatics and Classical Literary Study

This paper describes the Quantitative Criticism Lab, a collaborative ini...

Dive into Machine Learning Algorithms for Influenza Virus Host Prediction with Hemagglutinin Sequences

Influenza viruses mutate rapidly and can pose a threat to public health,...