Biomedical Named Entity Recognition via Dictionary-based Synonym Generalization

05/22/2023
by   Zihao Fu, et al.
0

Biomedical named entity recognition is one of the core tasks in biomedical natural language processing (BioNLP). To tackle this task, numerous supervised/distantly supervised approaches have been proposed. Despite their remarkable success, these approaches inescapably demand laborious human effort. To alleviate the need of human effort, dictionary-based approaches have been proposed to extract named entities simply based on a given dictionary. However, one downside of existing dictionary-based approaches is that they are challenged to identify concept synonyms that are not listed in the given dictionary, which we refer as the synonym generalization problem. In this study, we propose a novel Synonym Generalization (SynGen) framework that recognizes the biomedical concepts contained in the input text using span-based predictions. In particular, SynGen introduces two regularization terms, namely, (1) a synonym distance regularizer; and (2) a noise perturbation regularizer, to minimize the synonym generalization error. To demonstrate the effectiveness of our approach, we provide a theoretical analysis of the bound of synonym generalization error. We extensively evaluate our approach on a wide range of benchmarks and the results verify that SynGen outperforms previous dictionary-based models by notable margins. Lastly, we provide a detailed analysis to further reveal the merits and inner-workings of our approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/01/2021

How Do Your Biomedical Named Entity Models Generalize to Novel Entities?

The number of biomedical literature on new biomedical concepts is rapidl...
research
12/03/2019

HAMNER: Headword Amplified Multi-span Distantly Supervised Method for Domain Specific Named Entity Recognition

To tackle Named Entity Recognition (NER) tasks, supervised methods need ...
research
11/30/2022

AIONER: All-in-one scheme-based biomedical named entity recognition using deep learning

Biomedical named entity recognition (BioNER) seeks to automatically reco...
research
09/22/2018

A Byte-sized Approach to Named Entity Recognition

In biomedical literature, it is common for entity boundaries to not alig...
research
10/26/2016

Knowledge-Based Biomedical Word Sense Disambiguation with Neural Concept Embeddings

Biomedical word sense disambiguation (WSD) is an important intermediate ...
research
04/21/2021

End-to-end Biomedical Entity Linking with Span-based Dictionary Matching

Disease name recognition and normalization, which is generally called bi...
research
09/23/2019

Biomedical Mention Disambiguation using a Deep Learning Approach

Automatically locating named entities in natural language text - named e...

Please sign up or login with your details

Forgot password? Click here to reset