BioLORD: Learning Ontological Representations from Definitions (for Biomedical Concepts and their Textual Descriptions)

10/21/2022
by   François Remy, et al.
0

This work introduces BioLORD, a new pre-training strategy for producing meaningful representations for clinical sentences and biomedical concepts. State-of-the-art methodologies operate by maximizing the similarity in representation of names referring to the same concept, and preventing collapse through contrastive learning. However, because biomedical names are not always self-explanatory, it sometimes results in non-semantic representations. BioLORD overcomes this issue by grounding its concept representations using definitions, as well as short descriptions derived from a multi-relational knowledge graph consisting of biomedical ontologies. Thanks to this grounding, our model produces more semantic concept representations that match more closely the hierarchical structure of ontologies. BioLORD establishes a new state of the art for text similarity on both clinical sentences (MedSTS) and biomedical concepts (MayoSRS).

READ FULL TEXT

page 2

page 12

research
06/01/2023

Automatic Glossary of Clinical Terminology: a Large-Scale Dictionary of Biomedical Definitions Generated from Ontological Knowledge

Background: More than 400,000 biomedical concepts and some of their rela...
research
08/19/2016

Using Distributed Representations to Disambiguate Biomedical and Clinical Concepts

In this paper, we report a knowledge-based method for Word Sense Disambi...
research
05/11/2023

Detecting Idiomatic Multiword Expressions in Clinical Terminology using Definition-Based Representation Learning

This paper shines a light on the potential of definition-based semantic ...
research
09/21/2017

Retrofitting Concept Vector Representations of Medical Concepts to Improve Estimates of Semantic Similarity and Relatedness

Estimation of semantic similarity and relatedness between biomedical con...
research
09/20/2022

DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection

Open-world object detection, as a more general and challenging goal, aim...
research
02/29/2020

Clinical Text Summarization with Syntax-Based Negation and Semantic Concept Identification

In the era of clinical information explosion, a good strategy for clinic...
research
09/23/2013

An evolutionary approach to Function

Background: Understanding the distinction between function and role is v...

Please sign up or login with your details

Forgot password? Click here to reset