Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text

10/15/2021
by   Maya Varma, et al.
9

Named entity disambiguation (NED), which involves mapping textual mentions to structured entities, is particularly challenging in the medical domain due to the presence of rare entities. Existing approaches are limited by the presence of coarse-grained structural resources in biomedical knowledge bases as well as the use of training datasets that provide low coverage over uncommon resources. In this work, we address these issues by proposing a cross-domain data integration method that transfers structural knowledge from a general text knowledge base to the medical domain. We utilize our integration scheme to augment structural resources and generate a large biomedical NED dataset for pretraining. Our pretrained model with injected structural knowledge achieves state-of-the-art performance on two benchmark medical NED datasets: MedMentions and BC5CDR. Furthermore, we improve disambiguation of rare entities by up to 57 accuracy points.

READ FULL TEXT
research
04/21/2021

Improving Biomedical Pretrained Language Models with Knowledge

Pretrained language models have shown success in many natural language p...
research
09/21/2018

CollaboNet: collaboration of deep neural networks for biomedical named entity recognition

Background: Finding biomedical named entities is one of the most essenti...
research
06/05/2023

CoSiNES: Contrastive Siamese Network for Entity Standardization

Entity standardization maps noisy mentions from free-form text to standa...
research
11/12/2018

Bio-YODIE: A Named Entity Linking System for Biomedical Text

Ever-expanding volumes of biomedical text require automated semantic ann...
research
08/23/2023

Knowledge-injected Prompt Learning for Chinese Biomedical Entity Normalization

The Biomedical Entity Normalization (BEN) task aims to align raw, unstru...
research
03/30/2014

Enhancing Automated Decision Support across Medical and Oral Health Domains with Semantic Web Technologies

Research has shown that the general health and oral health of an individ...
research
12/12/2016

Knowledge Completion for Generics using Guided Tensor Factorization

Given a knowledge base (KB) rich in facts about common nouns or generics...

Please sign up or login with your details

Forgot password? Click here to reset