Exploring evolution-based -free protein language models as protein function predictors

06/14/2022
by   Mingyang Hu, et al.
0

Large-scale Protein Language Models (PLMs) have improved performance in protein prediction tasks, ranging from 3D structure prediction to various function predictions. In particular, AlphaFold, a ground-breaking AI system, could potentially reshape structural biology. However, the utility of the PLM module in AlphaFold, Evoformer, has not been explored beyond structure prediction. In this paper, we investigate the representation ability of three popular PLMs: ESM-1b (single sequence), MSA-Transformer (multiple sequence alignment) and Evoformer (structural), with a special focus on Evoformer. Specifically, we aim to answer the following key questions: (i) Does the Evoformer trained as part of AlphaFold produce representations amenable to predicting protein function? (ii) If yes, can Evoformer replace ESM-1b and MSA-Transformer? (iii) How much do these PLMs rely on evolution-related protein data? In this regard, are they complementary to each other? We compare these models by empirical study along with new insights and conclusions. Finally, we release code and datasets for reproducibility.

READ FULL TEXT

page 5

page 8

research
06/17/2022

Transformer Neural Networks Attending to Both Sequence and Structure for Protein Prediction Tasks

The increasing number of protein sequences decoded from genomes is openi...
research
02/24/2023

Retrieved Sequence Augmentation for Protein Representation Learning

Protein language models have excelled in a variety of tasks, ranging fro...
research
06/26/2020

BERTology Meets Biology: Interpreting Attention in Protein Language Models

Transformer architectures have proven to learn useful representations fo...
research
03/29/2022

Protein language models trained on multiple sequence alignments learn phylogenetic relationships

Self-supervised neural language models with attention have recently been...
research
09/07/2023

Insights Into the Inner Workings of Transformer Models for Protein Function Prediction

Motivation: We explored how explainable AI (XAI) can help to shed light ...
research
12/03/2018

FoldingZero: Protein Folding from Scratch in Hydrophobic-Polar Model

De novo protein structure prediction from amino acid sequence is one of ...
research
06/23/2022

ICOS Protein Expression Segmentation: Can Transformer Networks Give Better Results?

Biomarkers identify a patients response to treatment. With the recent ad...

Please sign up or login with your details

Forgot password? Click here to reset