RNA Alternative Splicing Prediction with Discrete Compositional Energy Network

03/07/2021
by   Alvin Chan, et al.
0

A single gene can encode for different protein versions through a process called alternative splicing. Since proteins play major roles in cellular functions, aberrant splicing profiles can result in a variety of diseases, including cancers. Alternative splicing is determined by the gene's primary sequence and other regulatory factors such as RNA-binding protein levels. With these as input, we formulate the prediction of RNA splicing as a regression task and build a new training dataset (CAPD) to benchmark learned models. We propose discrete compositional energy network (DCEN) which leverages the hierarchical relationships between splice sites, junctions and transcripts to approach this task. In the case of alternative splicing prediction, DCEN models mRNA transcript probabilities through its constituent splice junctions' energy values. These transcript probabilities are subsequently mapped to relative abundance values of key nucleotides and trained with ground-truth experimental measurements. Through our experiments on CAPD, we show that DCEN outperforms baselines and ablation variants.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/01/2021

Leveraging Sequence Embedding and Convolutional Neural Network for Protein Function Prediction

The capability of accurate prediction of protein functions and propertie...
research
12/07/2022

Unsupervised language models for disease variant prediction

There is considerable interest in predicting the pathogenicity of protei...
research
12/06/2021

An Effective GCN-based Hierarchical Multi-label classification for Protein Function Prediction

We propose an effective method to improve Protein Function Prediction (P...
research
11/12/2017

A Sequence-Based Mesh Classifier for the Prediction of Protein-Protein Interactions

The worldwide surge of multiresistant microbial strains has propelled th...
research
08/10/2021

A Brief Review of Machine Learning Techniques for Protein Phosphorylation Sites Prediction

Reversible Post-Translational Modifications (PTMs) have vital roles in e...

Please sign up or login with your details

Forgot password? Click here to reset