GraphSHA: Synthesizing Harder Samples for Class-Imbalanced Node Classification

06/16/2023
by   Wen-Zhi Li, et al.
0

Class imbalance is the phenomenon that some classes have much fewer instances than others, which is ubiquitous in real-world graph-structured scenarios. Recent studies find that off-the-shelf Graph Neural Networks (GNNs) would under-represent minor class samples. We investigate this phenomenon and discover that the subspaces of minor classes being squeezed by those of the major ones in the latent space is the main cause of this failure. We are naturally inspired to enlarge the decision boundaries of minor classes and propose a general framework GraphSHA by Synthesizing HArder minor samples. Furthermore, to avoid the enlarged minor boundary violating the subspaces of neighbor classes, we also propose a module called SemiMixup to transmit enlarged boundary information to the interior of the minor classes while blocking information propagation from minor classes to neighbor classes. Empirically, GraphSHA shows its effectiveness in enlarging the decision boundaries of minor classes, as it outperforms various baseline methods in class-imbalanced node classification with different GNN backbone encoders over seven public benchmark datasets. Code is avilable at https://github.com/wenzhilics/GraphSHA.

READ FULL TEXT
research
12/01/2021

Imbalanced Graph Classification via Graph-of-Graph Neural Networks

Graph Neural Networks (GNNs) have achieved unprecedented success in lear...
research
06/21/2021

GraphMixup: Improving Class-Imbalanced Node Classification on Graphs by Self-supervised Context Prediction

Recent years have witnessed great success in handling node classificatio...
research
06/10/2022

Synthetic Over-sampling for Imbalanced Node Classification with Graph Neural Networks

In recent years, graph neural networks (GNNs) have achieved state-of-the...
research
04/05/2021

Procrustean Training for Imbalanced Deep Learning

Neural networks trained with class-imbalanced data are known to perform ...
research
07/12/2017

Influence of Resampling on Accuracy of Imbalanced Classification

In many real-world binary classification tasks (e.g. detection of certai...
research
08/22/2022

LTE4G: Long-Tail Experts for Graph Neural Networks

Existing Graph Neural Networks (GNNs) usually assume a balanced situatio...
research
12/06/2018

RDEC: Integrating Regularization into Deep Embedded Clustering for Imbalanced Datasets

Clustering is a fundamental machine learning task and can be used in man...

Please sign up or login with your details

Forgot password? Click here to reset