Exploring deep learning methods for recognizing rare diseases and their clinical manifestations from texts

09/01/2021
by   Isabel Segura-Bedmar, et al.
12

Although rare diseases are characterized by low prevalence, approximately 300 million people are affected by a rare disease. The early and accurate diagnosis of these conditions is a major challenge for general practitioners, who do not have enough knowledge to identify them. In addition to this, rare diseases usually show a wide variety of manifestations, which might make the diagnosis even more difficult. A delayed diagnosis can negatively affect the patient's life. Therefore, there is an urgent need to increase the scientific and medical knowledge about rare diseases. Natural Language Processing (NLP) and Deep Learning can help to extract relevant information about rare diseases to facilitate their diagnosis and treatments. The paper explores the use of several deep learning techniques such as Bidirectional Long Short Term Memory (BiLSTM) networks or deep contextualized word representations based on Bidirectional Encoder Representations from Transformers (BERT) to recognize rare diseases and their clinical manifestations (signs and symptoms) in the RareDis corpus. This corpus contains more than 5,000 rare diseases and almost 6,000 clinical manifestations. BioBERT, a domain-specific language representation based on BERT and trained on biomedical corpora, obtains the best results. In particular, this model obtains an F1-score of 85.2 diseases, outperforming all the other models.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

page 7

page 9

page 10

research
08/02/2021

The RareDis corpus: a corpus annotated with rare diseases, their signs and symptoms

The RareDis corpus contains more than 5,000 rare diseases and almost 6,0...
research
11/25/2018

A Model-Based Reinforcement Learning Approach for a Rare Disease Diagnostic Task

In this work, we present our various contributions to the objective of b...
research
02/16/2018

How does undone science get funded? A bibliometric analysis linking rare diseases publications to national and European funding sources

One of the notable features of undone science debates is how formation o...
research
05/05/2021

Rare Disease Identification from Clinical Notes with Ontologies and Weak Supervision

The identification of rare diseases from clinical notes with Natural Lan...
research
10/09/2021

Unsupervised Representation Learning Meets Pseudo-Label Supervised Self-Distillation: A New Approach to Rare Disease Classification

Rare diseases are characterized by low prevalence and are often chronica...
research
11/26/2019

CONAN: Complementary Pattern Augmentation for Rare Disease Detection

Rare diseases affect hundreds of millions of people worldwide but are ha...
research
12/24/2020

Pain Assessment based on fNIRS using Bidirectional LSTMs

Assessing pain in patients unable to speak (also called non-verbal patie...

Please sign up or login with your details

Forgot password? Click here to reset