Distributed Graph Embedding with Information-Oriented Random Walks

03/28/2023
by   Peng Fang, et al.
0

Graph embedding maps graph nodes to low-dimensional vectors, and is widely adopted in machine learning tasks. The increasing availability of billion-edge graphs underscores the importance of learning efficient and effective embeddings on large graphs, such as link prediction on Twitter with over one billion edges. Most existing graph embedding methods fall short of reaching high data scalability. In this paper, we present a general-purpose, distributed, information-centric random walk-based graph embedding framework, DistGER, which can scale to embed billion-edge graphs. DistGER incrementally computes information-centric random walks. It further leverages a multi-proximity-aware, streaming, parallel graph partitioning strategy, simultaneously achieving high local partition quality and excellent workload balancing across machines. DistGER also improves the distributed Skip-Gram learning model to generate node embeddings by optimizing the access locality, CPU throughput, and synchronization efficiency. Experiments on real-world graphs demonstrate that compared to state-of-the-art distributed graph embedding frameworks, including KnightKing, DistDGL, and Pytorch-BigGraph, DistGER exhibits 2.33x-129x acceleration, 45 communication, and > 10

READ FULL TEXT

page 5

page 6

page 8

page 9

page 11

page 12

page 13

page 16

research
09/15/2022

Hub-aware Random Walk Graph Embedding Methods for Classification

In the last two decades we are witnessing a huge increase of valuable bi...
research
08/11/2023

Node Embedding for Homophilous Graphs with ARGEW: Augmentation of Random walks by Graph Edge Weights

Representing nodes in a network as dense vectors node embeddings is impo...
research
06/07/2020

Distributed-Memory Vertex-Centric Network Embedding for Large-Scale Graphs

Network embedding is an important step in many different computations ba...
research
05/28/2019

Triple2Vec: Learning Triple Embeddings from Knowledge Graphs

Graph embedding techniques allow to learn high-quality feature vectors f...
research
01/16/2020

ExEm: Expert Embedding using dominating set theory with deep learning approaches

A collaborative network is a social network that is comprised of experts...
research
07/03/2019

Graph Embeddings at Scale

Graph embedding is a popular algorithmic approach for creating vector re...
research
05/16/2019

Scalable Graph Embeddings via Sparse Transpose Proximities

Graph embedding learns low-dimensional representations for nodes in a gr...

Please sign up or login with your details

Forgot password? Click here to reset