Evaluation of GPT and BERT-based models on identifying protein-protein interactions in biomedical text

03/30/2023
by   Hasin Rehana, et al.
0

Detecting protein-protein interactions (PPIs) is crucial for understanding genetic mechanisms, disease pathogenesis, and drug design. However, with the fast-paced growth of biomedical literature, there is a growing need for automated and accurate extraction of PPIs to facilitate scientific knowledge discovery. Pre-trained language models, such as generative pre-trained transformer (GPT) and bidirectional encoder representations from transformers (BERT), have shown promising results in natural language processing (NLP) tasks. We evaluated the PPI identification performance of various GPT and BERT models using a manually curated benchmark corpus of 164 PPIs in 77 sentences from learning language in logic (LLL). BERT-based models achieved the best overall performance, with PubMedBERT achieving the highest precision (85.17 and F1-score (86.47 Despite not being explicitly trained for biomedical texts, GPT-4 achieved comparable performance to the best BERT models with 83.34 recall, and 79.18 effectively detect PPIs from text data and have the potential for use in biomedical literature mining tasks.

READ FULL TEXT

page 6

page 13

page 15

page 20

research
02/03/2023

Bioformer: an efficient transformer language model for biomedical text mining

Pretrained language models such as Bidirectional Encoder Representations...
research
11/30/2021

Text Mining Drug/Chemical-Protein Interactions using an Ensemble of BERT and T5 Based Models

In Track-1 of the BioCreative VII Challenge participants are asked to id...
research
10/31/2021

R-BERT-CNN: Drug-target interactions extraction from biomedical literature

In this research, we present our work participation for the DrugProt tas...
research
09/30/2020

Extracting Concepts for Precision Oncology from the Biomedical Literature

This paper describes an initial dataset and automatic natural language p...
research
06/26/2016

This before That: Causal Precedence in the Biomedical Domain

Causal precedence between biochemical interactions is crucial in the bio...
research
06/08/2023

Extensive Evaluation of Transformer-based Architectures for Adverse Drug Events Extraction

Adverse Event (ADE) extraction is one of the core tasks in digital pharm...
research
11/04/2020

Extracting Chemical-Protein Interactions via Calibrated Deep Neural Network and Self-training

The extraction of interactions between chemicals and proteins from sever...

Please sign up or login with your details

Forgot password? Click here to reset