DeHIN: A Decentralized Framework for Embedding Large-scale Heterogeneous Information Networks

01/08/2022
by   Mubashir Imran, et al.
6

Modeling heterogeneity by extraction and exploitation of high-order information from heterogeneous information networks (HINs) has been attracting immense research attention in recent times. Such heterogeneous network embedding (HNE) methods effectively harness the heterogeneity of small-scale HINs. However, in the real world, the size of HINs grow exponentially with the continuous introduction of new nodes and different types of links, making it a billion-scale network. Learning node embeddings on such HINs creates a performance bottleneck for existing HNE methods that are commonly centralized, i.e., complete data and the model are both on a single machine. To address large-scale HNE tasks with strong efficiency and effectiveness guarantee, we present Decentralized Embedding Framework for Heterogeneous Information Network (DeHIN) in this paper. In DeHIN, we generate a distributed parallel pipeline that utilizes hypergraphs in order to infuse parallelization into the HNE task. DeHIN presents a context preserving partition mechanism that innovatively formulates a large HIN as a hypergraph, whose hyperedges connect semantically similar nodes. Our framework then adopts a decentralized strategy to efficiently partition HINs by adopting a tree-like pipeline. Then, each resulting subnetwork is assigned to a distributed worker, which employs the deep information maximization theorem to locally learn node embeddings from the partition it receives. We further devise a novel embedding alignment scheme to precisely project independently learned node embeddings from all subnetworks onto a common vector space, thus allowing for downstream tasks like link prediction and node classification.

READ FULL TEXT

page 2

page 4

page 7

page 8

page 9

page 10

page 11

page 12

research
08/23/2020

MultiVERSE: a multiplex and multiplex-heterogeneous network embedding approach

Network embedding approaches are gaining momentum to analyse a large var...
research
11/30/2020

A Survey on Heterogeneous Graph Embedding: Methods, Techniques, Applications and Sources

Heterogeneous graphs (HGs) also known as heterogeneous information netwo...
research
07/12/2017

Deep Gaussian Embedding of Graphs: Unsupervised Inductive Learning via Ranking

Methods that learn representations of graph nodes play a critical role i...
research
04/17/2019

Compositional Network Embedding

Network embedding has proved extremely useful in a variety of network an...
research
08/20/2021

Semi-supervised Network Embedding with Differentiable Deep Quantisation

Learning accurate low-dimensional embeddings for a network is a crucial ...
research
01/29/2019

Representation Learning for Heterogeneous Information Networks via Embedding Events

Network representation learning (NRL) has been widely used to help analy...
research
07/03/2019

Graph Embeddings at Scale

Graph embedding is a popular algorithmic approach for creating vector re...

Please sign up or login with your details

Forgot password? Click here to reset