Semi-Supervised Heterogeneous Graph Learning with Multi-level Data Augmentation

11/30/2022
by   Ying Chen, et al.
0

In recent years, semi-supervised graph learning with data augmentation (DA) is currently the most commonly used and best-performing method to enhance model robustness in sparse scenarios with few labeled samples. Differing from homogeneous graph, DA in heterogeneous graph has greater challenges: heterogeneity of information requires DA strategies to effectively handle heterogeneous relations, which considers the information contribution of different types of neighbors and edges to the target nodes. Furthermore, over-squashing of information is caused by the negative curvature that formed by the non-uniformity distribution and strong clustering in complex graph. To address these challenges, this paper presents a novel method named Semi-Supervised Heterogeneous Graph Learning with Multi-level Data Augmentation (HG-MDA). For the problem of heterogeneity of information in DA, node and topology augmentation strategies are proposed for the characteristics of heterogeneous graph. And meta-relation-based attention is applied as one of the indexes for selecting augmented nodes and edges. For the problem of over-squashing of information, triangle based edge adding and removing are designed to alleviate the negative curvature and bring the gain of topology. Finally, the loss function consists of the cross-entropy loss for labeled data and the consistency regularization for unlabeled data. In order to effectively fuse the prediction results of various DA strategies, the sharpening is used. Existing experiments on public datasets, i.e., ACM, DBLP, OGB, and industry dataset MB show that HG-MDA outperforms current SOTA models. Additionly, HG-MDA is applied to user identification in internet finance scenarios, helping the business to add 30 and 9.8

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/27/2020

Semi-Supervised Learning with Data Augmentation for End-to-End ASR

In this paper, we apply Semi-Supervised Learning (SSL) along with Data A...
research
05/25/2023

Jointprop: Joint Semi-supervised Learning for Entity and Relation Extraction with Heterogeneous Graph-based Propagation

Semi-supervised learning has been an important approach to address chall...
research
08/10/2023

Cross-heterogeneity Graph Few-shot Learning

In recent years, heterogeneous graph few-shot learning has been proposed...
research
02/16/2022

A Data-Augmentation Is Worth A Thousand Samples: Exact Quantification From Analytical Augmented Sample Moments

Data-Augmentation (DA) is known to improve performance across tasks and ...
research
09/16/2022

Confidence-Guided Data Augmentation for Deep Semi-Supervised Training

We propose a new data augmentation technique for semi-supervised learnin...
research
06/20/2019

Efficient data augmentation using graph imputation neural networks

Recently, data augmentation in the semi-supervised regime, where unlabel...
research
05/14/2019

ActiveHNE: Active Heterogeneous Network Embedding

Heterogeneous network embedding (HNE) is a challenging task due to the d...

Please sign up or login with your details

Forgot password? Click here to reset