DeepAI AI Chat
Log In Sign Up

Convolutional LSTM Networks for Subcellular Localization of Proteins

by   Søren Kaae Sønderby, et al.

Machine learning is widely used to analyze biological sequence data. Non-sequential models such as SVMs or feed-forward neural networks are often used although they have no natural way of handling sequences of varying length. Recurrent neural networks such as the long short term memory (LSTM) model on the other hand are designed to handle sequences. In this study we demonstrate that LSTM networks predict the subcellular location of proteins given only the protein sequence with high accuracy (0.902) outperforming current state of the art algorithms. We further improve the performance by introducing convolutional filters and experiment with an attention mechanism which lets the LSTM focus on specific parts of the protein. Lastly we introduce new visualizations of both the convolutional filters and the attention mechanisms and show how they can be used to extract biological relevant knowledge from the LSTM networks.


page 2

page 8


Long Short-Term Attention

In order to learn effective features from temporal sequences, the long s...

Towards Interpreting Zoonotic Potential of Betacoronavirus Sequences With Attention

Current methods for viral discovery target evolutionarily conserved prot...

Continuous online sequence learning with an unsupervised neural network model

The ability to recognize and predict temporal sequences of sensory input...

Parallelizing Linear Recurrent Neural Nets Over Sequence Length

Recurrent neural networks (RNNs) are widely used to model sequential dat...

DeepProteomics: Protein family classification using Shallow and Deep Networks

The knowledge regarding the function of proteins is necessary as it give...

Clustering in Recurrent Neural Networks for Micro-Segmentation using Spending Personality

Customer segmentation has long been a productive field in banking. Howev...

Code Repositories