ProteiNN: Intrinsic-Extrinsic Convolution and Pooling for Scalable Deep Protein Analysis

07/13/2020
by   Pedro Hermosilla, et al.
0

Proteins perform a large variety of functions in living organisms, thus playing a key role in biology. As of now, available learning algorithms to process protein data do not consider several particularities of such data and/or do not scale well for large protein conformations. To fill this gap, we propose two new learning operations enabling deep 3D analysis of large-scale protein data. First, we introduce a novel convolution operator which considers both, the intrinsic (invariant under protein folding) as well as extrinsic (invariant under bonding) structure, by using n-D convolutions defined on both the Euclidean distance, as well as multiple geodesic distances between atoms in a multi-graph. Second, we enable a multi-scale protein analysis by introducing hierarchical pooling operators, exploiting the fact that proteins are a recombination of a finite set of amino acids, which can be pooled using shared pooling matrices. Lastly, we evaluate the accuracy of our algorithms on several large-scale data sets for common protein analysis tasks, where we outperform state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/10/2023

OpenProteinSet: Training data for structural biology at scale

Multiple sequence alignments (MSAs) of proteins encode rich biological i...
research
07/24/2023

DeepGATGO: A Hierarchical Pretraining-Based Graph-Attention Model for Automatic Protein Function Prediction

Automatic protein function prediction (AFP) is classified as a large-sca...
research
11/04/2018

Deep Robust Framework for Protein Function Prediction using Variable-Length Protein Sequences

Amino acid sequence portrays most intrinsic form of a protein and expres...
research
07/09/2019

Multiscale Visual Drilldown for the Analysis of Large Ensembles of Multi-Body Protein Complexes

When studying multi-body protein complexes, biochemists use computationa...
research
02/01/2022

AlphaDesign: A graph protein design method and benchmark on AlphaFoldDB

While DeepMind has tentatively solved protein folding, its inverse probl...
research
11/22/2019

Learnable Pooling in Graph Convolution Networks for Brain Surface Analysis

Brain surface analysis is essential to neuroscience, however, the comple...
research
04/04/2022

Multi-Scale Representation Learning on Proteins

Proteins are fundamental biological entities mediating key roles in cell...

Please sign up or login with your details

Forgot password? Click here to reset