A Byte-sized Approach to Named Entity Recognition

09/22/2018
by   Emily Sheng, et al.
0

In biomedical literature, it is common for entity boundaries to not align with word boundaries. Therefore, effective identification of entity spans requires approaches capable of considering tokens that are smaller than words. We introduce a novel, subword approach for named entity recognition (NER) that uses byte-pair encodings (BPE) in combination with convolutional and recurrent neural networks to produce byte-level tags of entities. We present experimental results on several standard biomedical datasets, namely the BioCreative VI Bio-ID, JNLPBA, and GENETAG datasets. We demonstrate competitive performance while bypassing the specialized domain expertise needed to create biomedical text tokenization rules.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/06/2022

BERN2: an advanced neural biomedical named entity recognition and normalization tool

In biomedical natural language processing, named entity recognition (NER...
research
09/21/2018

CollaboNet: collaboration of deep neural networks for biomedical named entity recognition

Background: Finding biomedical named entities is one of the most essenti...
research
06/27/2023

DMNER: Biomedical Entity Recognition by Detection and Matching

Biomedical named entity recognition (BNER) serves as the foundation for ...
research
07/29/2020

Biomedical and Clinical English Model Packages in the Stanza Python NLP Library

We introduce biomedical and clinical English model packages for the Stan...
research
08/15/2019

Improving Multi-Word Entity Recognition for Biomedical Texts

Biomedical Named Entity Recognition (BioNER) is a crucial step for analy...
research
05/22/2023

Biomedical Named Entity Recognition via Dictionary-based Synonym Generalization

Biomedical named entity recognition is one of the core tasks in biomedic...
research
10/26/2016

Knowledge-Based Biomedical Word Sense Disambiguation with Neural Concept Embeddings

Biomedical word sense disambiguation (WSD) is an important intermediate ...

Please sign up or login with your details

Forgot password? Click here to reset