Heterogeneous Network Representation Learning: Survey, Benchmark, Evaluation, and Beyond

04/01/2020
by   Carl Yang, et al.
0

Since real-world objects and their interactions are often multi-modal and multi-typed, heterogeneous networks have been widely used as a more powerful, realistic, and generic superclass of traditional homogeneous networks (graphs). Meanwhile, representation learning ( embedding) has recently been intensively studied and shown effective for various network mining and analytical tasks. Since there has already been a broad body of heterogeneous network embedding (HNE) algorithms but no dedicated survey, as the first contribution of this work, we pioneer in providing a unified paradigm for the systematic categorization and analysis over the merits of various existing HNE algorithms. Moreover, existing HNE algorithms, though mostly claimed generic, are often evaluated on different datasets. Understandable due to the natural application favor of HNE, such indirect comparisons largely hinder the proper attribution of improved task performance towards effective data preprocessing and novel technical design, especially considering the various ways possible to construct a heterogeneous network from real-world application data. Therefore, as the second contribution, we create four benchmark datasets with various properties regarding scale, structure, attribute/label availability, and . from different sources, towards the comprehensive evaluation of HNE algorithms. As the third contribution, we carefully refactor and amend the implementations of and create friendly interfaces for ten popular HNE algorithms, and provide all-around comparisons among them over multiple tasks and experimental settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2021

Network representation learning systematic review: ancestors and current development state

Real-world information networks are increasingly occurring across variou...
research
11/21/2021

Network representation learning: A macro and micro view

Graph is a universe data structure that is widely used to organize data ...
research
07/10/2023

Source-Aware Embedding Training on Heterogeneous Information Networks

Heterogeneous information networks (HINs) have been extensively applied ...
research
07/19/2020

A Multi-Semantic Metapath Model for Large Scale Heterogeneous Network Representation Learning

Network Embedding has been widely studied to model and manage data in a ...
research
10/14/2021

Network Representation Learning: From Preprocessing, Feature Extraction to Node Embedding

Network representation learning (NRL) advances the conventional graph mi...
research
04/28/2020

Heterogeneous Representation Learning: A Review

The real-world data usually exhibits heterogeneous properties such as mo...

Please sign up or login with your details

Forgot password? Click here to reset