Metropolis-Hastings Data Augmentation for Graph Neural Networks

03/26/2022
by   Hyeonjin Park, et al.
46

Graph Neural Networks (GNNs) often suffer from weak-generalization due to sparsely labeled data despite their promising results on various graph-based tasks. Data augmentation is a prevalent remedy to improve the generalization ability of models in many domains. However, due to the non-Euclidean nature of data space and the dependencies between samples, designing effective augmentation on graphs is challenging. In this paper, we propose a novel framework Metropolis-Hastings Data Augmentation (MH-Aug) that draws augmented graphs from an explicit target distribution for semi-supervised learning. MH-Aug produces a sequence of augmented graphs from the target distribution enables flexible control of the strength and diversity of augmentation. Since the direct sampling from the complex target distribution is challenging, we adopt the Metropolis-Hastings algorithm to obtain the augmented samples. We also propose a simple and effective semi-supervised learning strategy with generated samples from MH-Aug. Our extensive experiments demonstrate that MH-Aug can generate a sequence of samples according to the target distribution to significantly improve the performance of GNNs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2020

Data Augmentation for Graph Neural Networks

Data augmentation has been widely used to improve generalizability of ma...
research
03/12/2022

GRAND+: Scalable Graph Random Neural Networks

Graph neural networks (GNNs) have been widely adopted for semi-supervise...
research
03/26/2021

DivAug: Plug-in Automated Data Augmentation with Explicit Diversity Maximization

Human-designed data augmentation strategies have been replaced by automa...
research
09/14/2023

SC-MAD: Mixtures of Higher-order Networks for Data Augmentation

The myriad complex systems with multiway interactions motivate the exten...
research
08/16/2023

Graph Out-of-Distribution Generalization with Controllable Data Augmentation

Graph Neural Network (GNN) has demonstrated extraordinary performance in...
research
07/20/2022

Revisiting data augmentation for subspace clustering

Subspace clustering is the classical problem of clustering a collection ...
research
02/21/2023

Diffusion Probabilistic Models for Graph-Structured Prediction

This paper studies graph-structured prediction for supervised learning o...

Please sign up or login with your details

Forgot password? Click here to reset