How Do Your Biomedical Named Entity Models Generalize to Novel Entities?

01/01/2021
by   Hyunjae Kim, et al.
0

The number of biomedical literature on new biomedical concepts is rapidly increasing, which necessitates a reliable biomedical named entity recognition (BioNER) model for identifying new and unseen entity mentions. However, it is questionable whether existing BioNER models can effectively handle them. In this work, we systematically analyze the three types of recognition abilities of BioNER models: memorization, synonym generalization, and concept generalization. We find that (1) BioNER models are overestimated in terms of their generalization ability, and (2) they tend to exploit dataset biases, which hinders the models' abilities to generalize. To enhance the generalizability, we present a simple debiasing method based on the data statistics. Our method consistently improves the generalizability of the state-of-the-art (SOTA) models on five benchmark datasets, allowing them to better perform on unseen entity mentions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2021

Regularizing Models via Pointwise Mutual Information for Named Entity Recognition

In Named Entity Recognition (NER), pre-trained language models have been...
research
05/22/2023

Biomedical Named Entity Recognition via Dictionary-based Synonym Generalization

Biomedical named entity recognition is one of the core tasks in biomedic...
research
05/01/2020

Biomedical Entity Representations with Synonym Marginalization

Biomedical named entities often play important roles in many biomedical ...
research
10/26/2016

Knowledge-Based Biomedical Word Sense Disambiguation with Neural Concept Embeddings

Biomedical word sense disambiguation (WSD) is an important intermediate ...
research
07/02/2022

A Biomedical Pipeline to Detect Clinical and Non-Clinical Named Entities

There are a few challenges related to the task of biomedical named entit...
research
09/23/2019

Biomedical Mention Disambiguation using a Deep Learning Approach

Automatically locating named entities in natural language text - named e...
research
05/14/2016

Occurrence Statistics of Entities, Relations and Types on the Web

The problem of collecting reliable estimates of occurrence of entities o...

Please sign up or login with your details

Forgot password? Click here to reset