Neural Embeddings for Protein Graphs

06/07/2023
by   Francesco Ceccarelli, et al.
0

Proteins perform much of the work in living organisms, and consequently the development of efficient computational methods for protein representation is essential for advancing large-scale biological research. Most current approaches struggle to efficiently integrate the wealth of information contained in the protein sequence and structure. In this paper, we propose a novel framework for embedding protein graphs in geometric vector spaces, by learning an encoder function that preserves the structural distance between protein graphs. Utilizing Graph Neural Networks (GNNs) and Large Language Models (LLMs), the proposed framework generates structure- and sequence-aware protein representations. We demonstrate that our embeddings are successful in the task of comparing protein structures, while providing a significant speed-up compared to traditional approaches based on structural alignment. Our framework achieves remarkable results in the task of protein structure classification; in particular, when compared to other work, the proposed method shows an average F1-Score improvement of 26 samples and of 32 the training data. Our approach finds applications in areas such as drug prioritization, drug re-purposing, disease sub-type analysis and elsewhere.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2022

Advancing protein language models with linguistics: a roadmap for improved interpretability

Deep neural-network-based language models (LMs) are increasingly applied...
research
05/20/2022

EGR: Equivariant Graph Refinement and Assessment of 3D Protein Complex Structures

Protein complexes are macromolecules essential to the functioning and we...
research
07/25/2023

Prot2Text: Multimodal Protein's Function Generation with GNNs and Transformers

The complex nature of big biological systems pushed some scientists to c...
research
01/16/2020

Graph Attentional Autoencoder for Anticancer Hyperfood Prediction

Recent research efforts have shown the possibility to discover anticance...
research
12/12/2020

TALI: Protein Structure Alignment Using Backbone Torsion Angles

This article introduces a novel protein structure alignment method (name...
research
06/21/2023

Predicting protein variants with equivariant graph neural networks

Pre-trained models have been successful in many protein engineering task...
research
12/12/2022

Graph algorithms for predicting subcellular localization at the pathway level

Protein subcellular localization is an important factor in normal cellul...

Please sign up or login with your details

Forgot password? Click here to reset