Scientific Language Models for Biomedical Knowledge Base Completion: An Empirical Study

06/17/2021
by   Rahul Nadkarni, et al.
0

Biomedical knowledge graphs (KGs) hold rich information on entities such as diseases, drugs, and genes. Predicting missing links in these graphs can boost many important applications, such as drug design and repurposing. Recent work has shown that general-domain language models (LMs) can serve as "soft" KGs, and that they can be fine-tuned for the task of KG completion. In this work, we study scientific LMs for KG completion, exploring whether we can tap into their latent knowledge to enhance biomedical link prediction. We evaluate several domain-specific LMs, fine-tuning them on datasets centered on drugs and diseases that we represent as KGs and enrich with textual entity descriptions. We integrate the LM-based models with KG embedding models, using a router method that learns to assign each input example to either type of model and provides a substantial boost in performance. Finally, we demonstrate the advantage of LM models in the inductive setting with novel scientific entities. Our datasets and code are made publicly available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/26/2023

Exploring Large Language Models for Knowledge Graph Completion

Knowledge graphs play a vital role in numerous artificial intelligence t...
research
06/19/2023

Fine-tuning Large Enterprise Language Models via Ontological Reasoning

Large Language Models (LLMs) exploit fine-tuning as a technique to adapt...
research
08/18/2023

Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases

Large Language Models (LLMs) have demonstrated remarkable performance in...
research
06/06/2023

BioBLP: A Modular Framework for Learning on Multimodal Biomedical Knowledge Graphs

Knowledge graphs (KGs) are an important tool for representing complex re...
research
06/20/2023

Harnessing the Power of Adversarial Prompting and Large Language Models for Robust Hypothesis Generation in Astronomy

This study investigates the application of Large Language Models (LLMs),...
research
02/04/2022

Transformers and the representation of biomedical background knowledge

BioBERT and BioMegatron are Transformers models adapted for the biomedic...
research
04/05/2021

BBAEG: Towards BERT-based Biomedical Adversarial Example Generation for Text Classification

Healthcare predictive analytics aids medical decision-making, diagnosis ...

Please sign up or login with your details

Forgot password? Click here to reset