An Integrated Deep Learning and Dynamic Programming Method for Predicting Tumor Suppressor Genes, Oncogenes, and Fusion from PDB Structures

Mutations in proto-oncogenes (ONGO) and the loss of regulatory function of tumor suppression genes (TSG) are the common underlying mechanism for uncontrolled tumor growth. While cancer is a heterogeneous complex of distinct diseases, finding the potentiality of the genes related functionality to ONGO or TSG through computational studies can help develop drugs that target the disease. This paper proposes a classification method that starts with a preprocessing stage to extract the feature map sets from the input 3D protein structural information. The next stage is a deep convolutional neural network stage (DCNN) that outputs the probability of functional classification of genes. We explored and tested two approaches: in Approach 1, all filtered and cleaned 3D-protein-structures (PDB) are pooled together, whereas in Approach 2, the primary structures and their corresponding PDBs are separated according to the genes' primary structural information. Following the DCNN stage, a dynamic programming-based method is used to determine the final prediction of the primary structures' functionality. We validated our proposed method using the COSMIC online database. For the ONGO vs TSG classification problem, the AUROC of the DCNN stage for Approach 1 and Approach 2 DCNN are 0.978 and 0.765, respectively. The AUROCs of the final genes' primary structure functionality classification for Approach 1 and Approach 2 are 0.989, and 0.879, respectively. For comparison, the current state-of-the-art reported AUROC is 0.924.

READ FULL TEXT

page 7

page 15

page 20

page 22

page 29

page 30

research
10/07/2019

Weighted graphlets and deep neural networks for protein structure classification

As proteins with similar structures often have similar functions, analys...
research
04/18/2021

Functional Protein Structure Annotation Using a Deep Convolutional Generative Adversarial Network

Identifying novel functional protein structures is at the heart of molec...
research
09/22/2020

PS8-Net: A Deep Convolutional Neural Network to Predict the Eight-State Protein Secondary Structure

Protein secondary structure is crucial to creating an information bridge...
research
08/22/2019

Multi-Task Deep Learning with Dynamic Programming for Embryo Early Development Stage Classification from Time-Lapse Videos

Time-lapse is a technology used to record the development of embryos dur...
research
03/13/2021

Early Prediction and Diagnosis of Retinoblastoma Using Deep Learning Techniques

Retinoblastoma is the most prominent childhood primary intraocular malig...
research
10/07/2019

Combining docking pose rank and structure with deep learning improves protein-ligand binding mode prediction

We present a simple, modular graph-based convolutional neural network th...
research
09/16/2021

PDBench: Evaluating Computational Methods for Protein Sequence Design

Proteins perform critical processes in all living systems: converting so...

Please sign up or login with your details

Forgot password? Click here to reset