Annotating and normalizing biomedical NEs with limited knowledge

12/19/2019
by   Fernando Sánchez León, et al.
0

Named entity recognition (NER) is the very first step in the linguistic processing of any new domain. It is currently a common process in BioNLP on English clinical text. However, it is still in its infancy in other major languages, as it is the case for Spanish. Presented under the umbrella of the PharmaCoNER shared task, this paper describes a very simple method for the annotation and normalization of pharmacological, chemical and, ultimately, biomedical named entities in clinical cases. The system developed for the shared task is based on limited knowledge, collected, structured and munged in a way that clearly outperforms scores obtained by similar dictionary-based systems for English in the past. Along with this recovering of the knowledge-based methods for NER in subdomains, the paper also highlights the key contribution of resource-based systems in the validation and consolidation of both the annotation guidelines and the human annotation practices. In this sense, some of the authors discoverings on the overall quality of human annotated datasets question the above-mentioned `official' results obtained by this system, that ranked second (0.91 F1-score) and first (0.916 F1-score), respectively, in the two PharmaCoNER subtasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2019

Linguistically Informed Relation Extraction and Neural Architectures for Nested Named Entity Recognition in BioNLP-OST 2019

Named Entity Recognition (NER) and Relation Extraction (RE) are essentia...
research
10/14/2021

MIMICause : Defining, identifying and predicting types of causal relationships between biomedical concepts from clinical notes

Understanding of causal narratives communicated in clinical notes can he...
research
05/29/2023

Extrinsic Factors Affecting the Accuracy of Biomedical NER

Biomedical named entity recognition (NER) is a critial task that aims to...
research
06/10/2021

Neural Text Classification and Stacked Heterogeneous Embeddings for Named Entity Recognition in SMM4H 2021

This paper presents our findings from participating in the SMM4H Shared ...
research
06/03/2023

Impact of translation on biomedical information extraction from real-life clinical notes

The objective of our study is to determine whether using English tools t...
research
07/16/2019

MedCATTrainer: A Biomedical Free Text Annotation Interface with Active Learning and Research Use Case Specific Customisation

We present MedCATTrainer an interface for building, improving and custom...
research
04/25/2020

A Named Entity Based Approach to Model Recipes

Traditional cooking recipes follow a structure which can be modelled ver...

Please sign up or login with your details

Forgot password? Click here to reset