Improving Multi-Word Entity Recognition for Biomedical Texts

08/15/2019
by   Hamada A. Nayel, et al.
0

Biomedical Named Entity Recognition (BioNER) is a crucial step for analyzing Biomedical texts, which aims at extracting biomedical named entities from a given text. Different supervised machine learning algorithms have been applied for BioNER by various researchers. The main requirement of these approaches is an annotated dataset used for learning the parameters of machine learning algorithms. Segment Representation (SR) models comprise of different tag sets used for representing the annotated data, such as IOB2, IOE2 and IOBES. In this paper, we propose an extension of IOBES model to improve the performance of BioNER. The proposed SR model, FROBES, improves the representation of multi-word entities. We used Bidirectional Long Short-Term Memory (BiLSTM) network; an instance of Recurrent Neural Networks (RNN), to design a baseline system for BioNER and evaluated the new SR model on two datasets, i2b2/VA 2010 challenge dataset and JNLPBA 2004 shared task dataset. The proposed SR model outperforms other models for multi-word entities with length greater than two. Further, the outputs of different SR models have been combined using majority voting ensemble method which outperforms the baseline models performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/05/2019

Integrating Dictionary Feature into A Deep Learning Model for Disease Named Entity Recognition

In recent years, Deep Learning (DL) models are becoming important due to...
research
07/01/2020

Improving NER for Clinical Texts by Ensemble Approach using Segment Representations

Clinical Named Entity Recognition (Clinical-NER), which aims at identify...
research
06/05/2019

KAS-term: Extracting Slovene Terms from Doctoral Theses via Supervised Machine Learning

This paper presents a dataset and supervised learning experiments for te...
research
09/22/2018

A Byte-sized Approach to Named Entity Recognition

In biomedical literature, it is common for entity boundaries to not alig...
research
01/30/2018

Cross-type Biomedical Named Entity Recognition with Deep Multi-Task Learning

Motivation: Biomedical named entity recognition (BioNER) is the most fun...
research
10/26/2020

Using Unlabeled Texts for Named-Entity Recognition

Named Entity Recognition (NER) poses the problem of learning with multip...
research
01/15/2020

Transfer learning for biomedical named entity recognition with neural networks.

Motivation The explosive increase of biomedical literature has made i...

Please sign up or login with your details

Forgot password? Click here to reset