Positive-Unlabeled Learning with Uncertainty-aware Pseudo-label Selection

01/31/2022
by   Emilio Dorigatti, et al.
12

Pseudo-labeling solutions for positive-unlabeled (PU) learning have the potential to result in higher performance compared to cost-sensitive learning but are vulnerable to incorrectly estimated pseudo-labels. In this paper, we provide a theoretical analysis of a risk estimator that combines risk on PU and pseudo-labeled data. Furthermore, we show analytically as well as experimentally that such an estimator results in lower excess risk compared to using PU data alone, provided that enough samples are pseudo-labeled with acceptable error rates. We then propose PUUPL, a novel training procedure for PU learning that leverages the epistemic uncertainty of an ensemble of deep neural networks to minimize errors in pseudo-label selection. We conclude with extensive experiments showing the effectiveness of our proposed algorithm over different datasets, modalities, and learning tasks. These show that PUUPL enables a reduction of up to 20 negative samples are not provided for validation, setting a new state-of-the-art for PU learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/29/2021

Multi-class Probabilistic Bounds for Self-learning

Self-learning is a classical approach for learning with both labeled and...
research
08/31/2022

Seq-UPS: Sequential Uncertainty-aware Pseudo-label Selection for Semi-Supervised Text Recognition

This paper looks at semi-supervised learning (SSL) for image-based text ...
research
01/15/2021

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning

The recent research in semi-supervised learning (SSL) is mostly dominate...
research
05/04/2023

Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label Learning

Pseudo labeling is a popular and effective method to leverage the inform...
research
08/22/2022

PLMCL: Partial-Label Momentum Curriculum Learning for Multi-Label Image Classification

Multi-label image classification aims to predict all possible labels in ...
research
10/15/2022

How Does Pseudo-Labeling Affect the Generalization Error of the Semi-Supervised Gibbs Algorithm?

This paper provides an exact characterization of the expected generaliza...
research
03/25/2021

Prediction in the presence of response-dependent missing labels

In a variety of settings, limitations of sensing technologies or other s...

Please sign up or login with your details

Forgot password? Click here to reset