End-to-End Entity Classification on Multimodal Knowledge Graphs

03/25/2020
by   W. X. Wilcke, et al.
0

End-to-end multimodal learning on knowledge graphs has been left largely unaddressed. Instead, most end-to-end models such as message passing networks learn solely from the relational information encoded in graphs' structure: raw values, or literals, are either omitted completely or are stripped from their values and treated as regular nodes. In either case we lose potentially relevant information which could have otherwise been exploited by our learning methods. To avoid this, we must treat literals and non-literals as separate cases. We must also address each modality separately and accordingly: numbers, texts, images, geometries, et cetera. We propose a multimodal message passing network which not only learns end-to-end from the structure of graphs, but also from their possibly divers set of multimodal node features. Our model uses dedicated (neural) encoders to naturally learn embeddings for node features belonging to five different types of modalities, including images and geometries, which are projected into a joint representation space together with their relational information. We demonstrate our model on a node classification task, and evaluate the effect that each modality has on the overall performance. Our result supports our hypothesis that including information from multiple modalities can help our models obtain a better overall performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/03/2023

End-to-End Learning on Multimodal Knowledge Graphs

Knowledge graphs enable data scientists to learn end-to-end on heterogen...
research
11/10/2020

Node Attribute Completion in Knowledge Graphs with Multi-Relational Propagation

The existing literature on knowledge graph completion mostly focuses on ...
research
10/26/2022

Meta-node: A Concise Approach to Effectively Learn Complex Relationships in Heterogeneous Graphs

Existing message passing neural networks for heterogeneous graphs rely o...
research
03/04/2022

R-GCN: The R Could Stand for Random

The inception of Relational Graph Convolutional Networks (R-GCNs) marked...
research
07/13/2023

MaxCorrMGNN: A Multi-Graph Neural Network Framework for Generalized Multimodal Fusion of Medical Data for Outcome Prediction

With the emergence of multimodal electronic health records, the evidence...
research
11/14/2016

Zero-resource Machine Translation by Multimodal Encoder-decoder Network with Multimedia Pivot

We propose an approach to build a neural machine translation system with...
research
05/13/2020

Towards Better Graph Representation: Two-Branch Collaborative Graph Neural Networks for Multimodal Marketing Intention Detection

Inspired by the fact that spreading and collecting information through t...

Please sign up or login with your details

Forgot password? Click here to reset