Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label Learning

05/04/2023
by   Ming-Kun Xie, et al.
0

Pseudo labeling is a popular and effective method to leverage the information of unlabeled data. Conventional instance-aware pseudo labeling methods often assign each unlabeled instance with a pseudo label based on its predicted probabilities. However, due to the unknown number of true labels, these methods cannot generalize well to semi-supervised multi-label learning (SSMLL) scenarios, since they would suffer from the risk of either introducing false positive labels or neglecting true positive ones. In this paper, we propose to solve the SSMLL problems by performing Class-distribution-Aware Pseudo labeling (CAP), which encourages the class distribution of pseudo labels to approximate the true one. Specifically, we design a regularized learning framework consisting of the class-aware thresholds to control the number of pseudo labels for each class. Given that the labeled and unlabeled examples are sampled according to the same distribution, we determine the thresholds by exploiting the empirical class distribution, which can be treated as a tight approximation to the true one. Theoretically, we show that the generalization performance of the proposed method is dependent on the pseudo labeling error, which can be significantly reduced by the CAP strategy. Extensive experimental results on multiple benchmark datasets validate that CAP can effectively solve the SSMLL problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2021

Distribution-Aware Semantics-Oriented Pseudo-label for Imbalanced Semi-Supervised Learning

The capability of the traditional semi-supervised learning (SSL) methods...
research
08/31/2022

Seq-UPS: Sequential Uncertainty-aware Pseudo-label Selection for Semi-Supervised Text Recognition

This paper looks at semi-supervised learning (SSL) for image-based text ...
research
08/30/2022

PercentMatch: Percentile-based Dynamic Thresholding for Multi-Label Semi-Supervised Classification

While much of recent study in semi-supervised learning (SSL) has achieve...
research
08/23/2023

Semi-Supervised Learning via Weight-aware Distillation under Class Distribution Mismatch

Semi-Supervised Learning (SSL) under class distribution mismatch aims to...
research
09/29/2021

Active Refinement for Multi-Label Learning: A Pseudo-Label Approach

The goal of multi-label learning (MLL) is to associate a given instance ...
research
02/17/2023

Learning from Label Proportion with Online Pseudo-Label Decision by Regret Minimization

This paper proposes a novel and efficient method for Learning from Label...
research
01/31/2022

Positive-Unlabeled Learning with Uncertainty-aware Pseudo-label Selection

Pseudo-labeling solutions for positive-unlabeled (PU) learning have the ...

Please sign up or login with your details

Forgot password? Click here to reset