Semi-supervised learning by selective training with pseudo labels via confidence estimation

03/15/2021
by   Masato Ishii, et al.
0

We propose a novel semi-supervised learning (SSL) method that adopts selective training with pseudo labels. In our method, we generate hard pseudo-labels and also estimate their confidence, which represents how likely each pseudo-label is to be correct. Then, we explicitly select which pseudo-labeled data should be used to update the model. Specifically, assuming that loss on incorrectly pseudo-labeled data sensitively increase against data augmentation, we select the data corresponding to relatively small loss after applying data augmentation. The confidence is used not only for screening candidates of pseudo-labeled data to be selected but also for automatically deciding how many pseudo-labeled data should be selected within a mini-batch. Since accurate estimation of the confidence is crucial in our method, we also propose a new data augmentation method, called MixConf, that enables us to obtain confidence-calibrated models even when the number of training data is small. Experimental results with several benchmark datasets validate the advantage of our SSL method as well as MixConf.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2022

Dense FixMatch: a simple semi-supervised learning method for pixel-wise prediction tasks

We propose Dense FixMatch, a simple method for online semi-supervised le...
research
09/18/2023

Towards Self-Adaptive Pseudo-Label Filtering for Semi-Supervised Learning

Recent semi-supervised learning (SSL) methods typically include a filter...
research
01/03/2022

An analysis of over-sampling labeled data in semi-supervised learning with FixMatch

Most semi-supervised learning methods over-sample labeled data when cons...
research
09/16/2022

Confidence-Guided Data Augmentation for Deep Semi-Supervised Training

We propose a new data augmentation technique for semi-supervised learnin...
research
07/13/2023

Intent-calibrated Self-training for Answer Selection in Open-domain Dialogues

Answer selection in open-domain dialogues aims to select an accurate ans...
research
01/25/2022

AggMatch: Aggregating Pseudo Labels for Semi-Supervised Learning

Semi-supervised learning (SSL) has recently proven to be an effective pa...
research
03/02/2023

In all LikelihoodS: How to Reliably Select Pseudo-Labeled Data for Self-Training in Semi-Supervised Learning

Self-training is a simple yet effective method within semi-supervised le...

Please sign up or login with your details

Forgot password? Click here to reset