Synthetic Over-sampling for Imbalanced Node Classification with Graph Neural Networks

06/10/2022
by   Tianxiang Zhao, et al.
0

In recent years, graph neural networks (GNNs) have achieved state-of-the-art performance for node classification. However, most existing GNNs would suffer from the graph imbalance problem. In many real-world scenarios, node classes are imbalanced, with some majority classes making up most parts of the graph. The message propagation mechanism in GNNs would further amplify the dominance of those majority classes, resulting in sub-optimal classification performance. In this work, we seek to address this problem by generating pseudo instances of minority classes to balance the training data, extending previous over-sampling-based techniques. This task is non-trivial, as those techniques are designed with the assumption that instances are independent. Neglection of relation information would complicate this oversampling process. Furthermore, the node classification task typically takes the semi-supervised setting with only a few labeled nodes, providing insufficient supervision for the generation of minority instances. Generated new nodes of low quality would harm the trained classifier. In this work, we address these difficulties by synthesizing new nodes in a constructed embedding space, which encodes both node attributes and topology information. Furthermore, an edge generator is trained simultaneously to model the graph structure and provide relations for new samples. To further improve the data efficiency, we also explore synthesizing mixed “in-between” nodes to utilize nodes from the majority class in this over-sampling process. Experiments on real-world datasets validate the effectiveness of our proposed framework.

READ FULL TEXT
research
03/16/2021

GraphSMOTE: Imbalanced Node Classification on Graphs with Graph Neural Networks

Node classification is an important research topic in graph learning. Gr...
research
06/05/2021

ImGAGN:Imbalanced Network Embedding via Generative Adversarial Graph Networks

Imbalanced classification on graphs is ubiquitous yet challenging in man...
research
02/24/2023

GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification

Graph neural networks (GNNs) have achieved great success in node classif...
research
03/18/2023

UNREAL:Unlabeled Nodes Retrieval and Labeling for Heavily-imbalanced Node Classification

Extremely skewed label distributions are common in real-world node class...
research
04/28/2023

Imbalanced Node Classification Beyond Homophilic Assumption

Imbalanced node classification widely exists in real-world networks wher...
research
06/16/2023

GraphSHA: Synthesizing Harder Samples for Class-Imbalanced Node Classification

Class imbalance is the phenomenon that some classes have much fewer inst...
research
06/21/2021

GraphMixup: Improving Class-Imbalanced Node Classification on Graphs by Self-supervised Context Prediction

Recent years have witnessed great success in handling node classificatio...

Please sign up or login with your details

Forgot password? Click here to reset