Constructing large scale biomedical knowledge bases from scratch with rapid annotation of interpretable patterns

07/02/2019
by   Julien Fauqueur, et al.
0

Knowledge base construction is crucial for summarising, understanding and inferring relationships between biomedical entities. However, for many practical applications such as drug discovery, the scarcity of relevant facts (e.g. gene X is therapeutic target for disease Y) severely limits a domain expert's ability to create a usable knowledge base, either directly or by training a relation extraction model. In this paper, we present a simple and effective method of extracting new facts with a pre-specified binary relationship type from the biomedical literature, without requiring any training data or hand-crafted rules. Our system discovers, ranks and presents the most salient patterns to domain experts in an interpretable form. By marking patterns as compatible with the desired relationship type, experts indirectly batch-annotate candidate pairs whose relationship is expressed with such patterns in the literature. Even with a complete absence of seed data, experts are able to discover thousands of high-quality pairs with the desired relationship within minutes. When a small number of relevant pairs do exist - even when their relationship is more general (e.g. gene X is biologically associated with disease Y) than the relationship of interest - our system leverages them in order to i) learn a better ranking of the patterns to be annotated or ii) generate weakly labelled pairs in a fully automated manner. We evaluate our method both intrinsically and via a downstream knowledge base completion task, and show that it is an effective way of constructing knowledge bases when few or no relevant facts are already available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/30/2021

Knowledge Base Completion Meets Transfer Learning

The aim of knowledge base completion is to predict unseen facts from exi...
research
05/26/2019

Path Ranking with Attention to Type Hierarchies

The knowledge base completion problem is the problem of inferring missin...
research
07/13/2023

Towards Populating Generalizable Engineering Design Knowledge

Aiming to populate generalizable engineering design knowledge, we propos...
research
04/10/2022

MedDistant19: A Challenging Benchmark for Distantly Supervised Biomedical Relation Extraction

Relation Extraction in the biomedical domain is challenging due to the l...
research
04/21/2019

Fact Discovery from Knowledge Base via Facet Decomposition

During the past few decades, knowledge bases (KBs) have experienced rapi...
research
07/01/2021

Essence of Factual Knowledge

Knowledge bases are collections of domain-specific and commonsense facts...
research
02/04/2021

Towards a Flexible System Architecture for Automated Knowledge Base Construction Frameworks

Although knowledge bases play an important role in many domains (includi...

Please sign up or login with your details

Forgot password? Click here to reset