FoodChem: A food-chemical relation extraction model

10/05/2021
by   Gjorgjina Cenikj, et al.
0

In this paper, we present FoodChem, a new Relation Extraction (RE) model for identifying chemicals present in the composition of food entities, based on textual information provided in biomedical peer-reviewed scientific literature. The RE task is treated as a binary classification problem, aimed at identifying whether the contains relation exists between a food-chemical entity pair. This is accomplished by fine-tuning BERT, BioBERT and RoBERTa transformer models. For evaluation purposes, a novel dataset with annotated contains relations in food-chemical entity pairs is generated, in a golden and silver version. The models are integrated into a voting scheme in order to produce the silver version of the dataset which we use for augmenting the individual models, while the manually annotated golden version is used for their evaluation. Out of the three evaluated models, the BioBERT model achieves the best results, with a macro averaged F1 score of 0.902 in the unbalanced augmentation setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/10/2022

AIFB-WebScience at SemEval-2022 Task 12: Relation Extraction First – Using Relation Extraction to Identify Entities

In this paper, we present an end-to-end joint entity and relation extrac...
research
04/03/2023

End-to-End Models for Chemical-Protein Interaction Extraction: Better Tokenization and Span-Based Pipeline Strategies

End-to-end relation extraction (E2ERE) is an important task in informati...
research
11/01/2020

Investigation of BERT Model on Biomedical Relation Extraction Based on Revised Fine-tuning Mechanism

With the explosive growth of biomedical literature, designing automatic ...
research
02/20/2023

A Two-step Approach for Handling Zero-Cardinality in Relation Extraction

Relation tuple extraction from text is an important task for building kn...
research
11/30/2021

Text Mining Drug/Chemical-Protein Interactions using an Ensemble of BERT and T5 Based Models

In Track-1 of the BioCreative VII Challenge participants are asked to id...
research
02/05/2018

Chemical-protein relation extraction with ensembles of SVM, CNN, and RNN models

Text mining the relations between chemicals and proteins is an increasin...
research
11/20/2020

Learning Informative Representations of Biomedical Relations with Latent Variable Models

Extracting biomedical relations from large corpora of scientific documen...

Please sign up or login with your details

Forgot password? Click here to reset