A Silver Standard Corpus of Human Phenotype-Gene Relations

03/26/2019
by   Diana Sousa, et al.
0

Human phenotype-gene relations are fundamental to fully understand the origin of some phenotypic abnormalities and their associated diseases. Biomedical literature is the most comprehensive source of these relations, however, we need Relation Extraction tools to automatically recognize them. Most of these tools require an annotated corpus and to the best of our knowledge, there is no corpus available annotated with human phenotype-gene relations. This paper presents the Phenotype-Gene Relations (PGR) corpus, a silver standard corpus of human phenotype and gene annotations and their relations. The corpus consists of 1712 abstracts, 5676 human phenotype annotations, 13835 gene annotations, and 4283 relations. We generated this corpus using Named-Entity Recognition tools, whose results were partially evaluated by eight curators, obtaining a precision of 87.01 results with two state-of-the-art deep learning tools, namely 78.05 precision. The PGR corpus was made publicly available to the research community.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2017

Creation of an Annotated Corpus of Spanish Radiology Reports

This paper presents a new annotated corpus of 513 anonymized radiology r...
research
01/20/2020

BiOnt: Deep Learning using Multiple Biomedical Ontologies for Relation Extraction

Successful biomedical relation extraction can provide evidence to resear...
research
06/14/2023

Building a Corpus for Biomedical Relation Extraction of Species Mentions

We present a manually annotated corpus, Species-Species Interaction, for...
research
08/03/2022

DeepProphet2 – A Deep Learning Gene Recommendation Engine

New powerful tools for tackling life science problems have been created ...
research
04/07/2020

A German Corpus for Fine-Grained Named Entity Recognition and Relation Extraction of Traffic and Industry Events

Monitoring mobility- and industry-relevant events is important in areas ...
research
04/21/2022

Recovering Patient Journeys: A Corpus of Biomedical Entities and Relations on Twitter (BEAR)

Text mining and information extraction for the medical domain has focuse...
research
12/07/2017

A Corpus of Deep Argumentative Structures as an Explanation to Argumentative Relations

In this paper, we compose a new task for deep argumentative structure an...

Please sign up or login with your details

Forgot password? Click here to reset