Beyond MeSH: Fine-Grained Semantic Indexing of Biomedical Literature based on Weak Supervision

05/15/2020
by   Anastasios Nentidis, et al.
0

In this work, we propose a method for the automated refinement of subject annotations in biomedical literature at the level of concepts. Semantic indexing and search of biomedical articles in MEDLINE/PubMed are based on semantic subject annotations with MeSH descriptors that may correspond to several related but distinct biomedical concepts. Such semantic annotations do not adhere to the level of detail available in the domain knowledge and may not be sufficient to fulfil the information needs of experts in the domain. To this end, we propose a new method that uses weak supervision to train a concept annotator on the literature available for a particular disease. We test this method on the MeSH descriptors for two diseases: Alzheimer's Disease and Duchenne Muscular Dystrophy. The results indicate that concept-occurrence is a strong heuristic for automated subject annotation refinement and its use as weak supervision can lead to improved concept-level annotations. The fine-grained semantic annotations can enable more precise literature retrieval, sustain the semantic integration of subject annotations with other domain resources and ease the maintenance of consistent subject annotations, as new more detailed entries are added in the MeSH thesaurus over time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/23/2023

Large-scale fine-grained semantic indexing of biomedical literature based on weakly-supervised deep learning

Semantic indexing of biomedical literature is usually done at the level ...
research
01/20/2021

What is all this new MeSH about? Exploring the semantic provenance of new descriptors in the MeSH thesaurus

The Medical Subject Headings (MeSH) thesaurus is a controlled vocabulary...
research
01/09/2022

Zero-Shot and Few-Shot Classification of Biomedical Articles in Context of the COVID-19 Pandemic

MeSH (Medical Subject Headings) is a large thesaurus created by the Nati...
research
10/13/2022

Overview of BioASQ 2022: The tenth BioASQ challenge on Large-Scale Biomedical Semantic Indexing and Question Answering

This paper presents an overview of the tenth edition of the BioASQ chall...
research
12/18/2019

Semantic integration of disease-specific knowledge

Biomedical researchers working on a specific disease need up-to-date and...
research
03/19/2021

Biomedical Convergence Facilitated by the Emergence of Technological and Informatic Capabilities

We analyzed Medical Subject Headings (MeSH) from 21.6 million research a...
research
03/30/2022

The Weak Supervision Landscape

Many ways of annotating a dataset for machine learning classification ta...

Please sign up or login with your details

Forgot password? Click here to reset