BioBLP: A Modular Framework for Learning on Multimodal Biomedical Knowledge Graphs

06/06/2023
by   Daniel Daza, et al.
0

Knowledge graphs (KGs) are an important tool for representing complex relationships between entities in the biomedical domain. Several methods have been proposed for learning embeddings that can be used to predict new links in such graphs. Some methods ignore valuable attribute data associated with entities in biomedical KGs, such as protein sequences, or molecular graphs. Other works incorporate such data, but assume that entities can be represented with the same data modality. This is not always the case for biomedical KGs, where entities exhibit heterogeneous modalities that are central to their representation in the subject domain. We propose a modular framework for learning embeddings in KGs with entity attributes, that allows encoding attribute data of different modalities while also supporting entities with missing attributes. We additionally propose an efficient pretraining strategy for reducing the required training runtime. We train models using a biomedical KG containing approximately 2 million triples, and evaluate the performance of the resulting entity embeddings on the tasks of link prediction, and drug-protein interaction prediction, comparing against methods that do not take attribute data into account. In the standard link prediction evaluation, the proposed method results in competitive, yet lower performance than baselines that do not use attribute data. When evaluated in the task of drug-protein interaction prediction, the method compares favorably with the baselines. We find settings involving low degree entities, which make up for a substantial amount of the set of entities in the KG, where our method outperforms the baselines. Our proposed pretraining strategy yields significantly higher performance while reducing the required training runtime. Our implementation is available at https://github.com/elsevier-AI-Lab/BioBLP .

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/28/2021

Heterogeneous Graph based Deep Learning for Biomedical Network Link Prediction

Multi-scale biomedical knowledge networks are expanding with emerging ex...
research
10/22/2021

Drug Similarity and Link Prediction Using Graph Embeddings on Medical Knowledge Graphs

The paper utilizes the graph embeddings generated for entities of a larg...
research
06/17/2021

Scientific Language Models for Biomedical Knowledge Base Completion: An Empirical Study

Biomedical knowledge graphs (KGs) hold rich information on entities such...
research
08/16/2017

Multi-task Neural Network for Non-discrete Attribute Prediction in Knowledge Graphs

Many popular knowledge graphs such as Freebase, YAGO or DBPedia maintain...
research
02/06/2020

Message Passing for Query Answering over Knowledge Graphs

Logic-based systems for query answering over knowledge graphs return onl...
research
06/28/2021

Modelling Monotonic and Non-Monotonic Attribute Dependencies with Embeddings: A Theoretical Analysis

During the last decade, entity embeddings have become ubiquitous in Arti...
research
12/10/2019

OpenBioLink: A resource and benchmarking framework for large-scale biomedical link prediction

SUMMARY: Recently, novel machine-learning algorithms have shown potentia...

Please sign up or login with your details

Forgot password? Click here to reset