Graph Out-of-Distribution Generalization with Controllable Data Augmentation

08/16/2023
by   Bin Lu, et al.
0

Graph Neural Network (GNN) has demonstrated extraordinary performance in classifying graph properties. However, due to the selection bias of training and testing data (e.g., training on small graphs and testing on large graphs, or training on dense graphs and testing on sparse graphs), distribution deviation is widespread. More importantly, we often observe hybrid structure distribution shift of both scale and density, despite of one-sided biased data partition. The spurious correlations over hybrid distribution deviation degrade the performance of previous GNN methods and show large instability among different datasets. To alleviate this problem, we propose to jointly manipulate the training distribution with controllable data augmentation in metric space. Specifically, we first extract the graph rationales to eliminate the spurious correlations due to irrelevant information. Secondly, we generate virtual samples with perturbation on graph rationale representation domain to obtain potential OOD training samples. Finally, we propose OOD calibration to measure the distribution deviation of virtual samples by leveraging Extreme Value Theory, and further actively control the training distribution by emphasizing the impact of virtual OOD samples. Extensive studies on several real-world datasets on graph classification demonstrate the superiority of our proposed method over state-of-the-art baselines.

READ FULL TEXT

page 1

page 10

page 13

research
12/07/2021

OOD-GNN: Out-of-Distribution Generalized Graph Neural Network

Graph neural networks (GNNs) have achieved impressive performance when t...
research
03/26/2022

Metropolis-Hastings Data Augmentation for Graph Neural Networks

Graph Neural Networks (GNNs) often suffer from weak-generalization due t...
research
11/10/2021

Graph Transplant: Node Saliency-Guided Graph Mixup with Local Structure Preservation

Graph-structured datasets usually have irregular graph sizes and connect...
research
03/29/2021

Graph Classification by Mixture of Diverse Experts

Graph classification is a challenging research problem in many applicati...
research
11/20/2021

Generalizing Graph Neural Networks on Out-Of-Distribution Graphs

Graph Neural Networks (GNNs) are proposed without considering the agnost...
research
10/02/2022

Metric Distribution to Vector: Constructing Data Representation via Broad-Scale Discrepancies

Graph embedding provides a feasible methodology to conduct pattern class...
research
02/06/2023

Energy-based Out-of-Distribution Detection for Graph Neural Networks

Learning on graphs, where instance nodes are inter-connected, has become...

Please sign up or login with your details

Forgot password? Click here to reset