DGL-KE: Training Knowledge Graph Embeddings at Scale

04/18/2020
by   Da Zheng, et al.
0

Knowledge graphs have emerged as a key abstraction for organizing information in diverse domains and their embeddings are increasingly used to harness their information in various information retrieval and machine learning tasks. However, the ever growing size of knowledge graphs requires computationally efficient algorithms capable of scaling to graphs with millions of nodes and billions of edges. This paper presents DGL-KE, an open-source package to efficiently compute knowledge graph embeddings. DGL-KE introduces various novel optimizations that accelerate training on knowledge graphs with millions of nodes and billions of edges using multi-processing, multi-GPU, and distributed parallelism. These optimizations are designed to increase data locality, reduce communication overhead, overlap computations with memory accesses, and achieve high operation efficiency. Experiments on knowledge graphs consisting of over 86M nodes and 338M edges show that DGL-KE can compute embeddings in 100 minutes on an EC2 instance with 8 GPUs and 30 minutes on an EC2 cluster with 4 machines with 48 cores/machine. These results represent a 2x 5x speedup over the best competing approaches. DGL-KE is available on https://github.com/awslabs/dgl-ke.

READ FULL TEXT
research
01/21/2020

A Physical Embedding Model for Knowledge Graphs

Knowledge graph embedding methods learn continuous vector representation...
research
09/16/2020

RDF2Vec Light – A Lightweight Approach for Knowledge Graph Embeddings

Knowledge graph embedding approaches represent nodes and edges of graphs...
research
05/28/2019

Triple2Vec: Learning Triple Embeddings from Knowledge Graphs

Graph embedding techniques allow to learn high-quality feature vectors f...
research
01/24/2022

A Knowledge Graph Embeddings based Approach for Author Name Disambiguation using Literals

Scholarly data is growing continuously containing information about the ...
research
03/28/2022

WawPart: Workload-Aware Partitioning of Knowledge Graphs

Large-scale datasets in the form of knowledge graphs are often used in n...
research
09/07/2023

PyGraft: Configurable Generation of Schemas and Knowledge Graphs at Your Fingertips

Knowledge graphs (KGs) have emerged as a prominent data representation a...
research
06/20/2023

UUKG: Unified Urban Knowledge Graph Dataset for Urban Spatiotemporal Prediction

Accurate Urban SpatioTemporal Prediction (USTP) is of great importance t...

Please sign up or login with your details

Forgot password? Click here to reset