The RareDis corpus: a corpus annotated with rare diseases, their signs and symptoms

The RareDis corpus contains more than 5,000 rare diseases and almost 6,000 clinical manifestations are annotated. Moreover, the Inter Annotator Agreement evaluation shows a relatively high agreement (F1-measure equal to 83.5 exact match criteria for the entities and equal to 81.3 Based on these results, this corpus is of high quality, supposing a significant step for the field since there is a scarcity of available corpus annotated with rare diseases. This could open the door to further NLP applications, which would facilitate the diagnosis and treatment of these rare diseases and, therefore, would improve dramatically the quality of life of these patients.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2021

Exploring deep learning methods for recognizing rare diseases and their clinical manifestations from texts

Although rare diseases are characterized by low prevalence, approximatel...
research
06/22/2023

Identifying and Extracting Rare Disease Phenotypes with Large Language Models

Rare diseases (RDs) are collectively common and affect 300 million peopl...
research
12/06/2018

Adpositional Supersenses for Mandarin Chinese

This study adapts Semantic Network of Adposition and Case Supersenses (S...
research
04/25/2019

Terminologies augmented recurrent neural network model for clinical named entity recognition

We aimed to enhance the performance of a supervised model for clinical n...
research
03/24/2021

Finnish Paraphrase Corpus

In this paper, we introduce the first fully manually annotated paraphras...
research
05/05/2021

Rare Disease Identification from Clinical Notes with Ontologies and Weak Supervision

The identification of rare diseases from clinical notes with Natural Lan...
research
01/19/2017

Rare Disease Physician Targeting: A Factor Graph Approach

In rare disease physician targeting, a major challenge is how to identif...

Please sign up or login with your details

Forgot password? Click here to reset