Iterative Pseudo-Labeling with Deep Feature Annotation and Confidence-Based Sampling

09/06/2021
by   Barbara C Benato, et al.
0

Training deep neural networks is challenging when large and annotated datasets are unavailable. Extensive manual annotation of data samples is time-consuming, expensive, and error-prone, notably when it needs to be done by experts. To address this issue, increased attention has been devoted to techniques that propagate uncertain labels (also called pseudo labels) to large amounts of unsupervised samples and use them for training the model. However, these techniques still need hundreds of supervised samples per class in the training set and a validation set with extra supervised samples to tune the model. We improve a recent iterative pseudo-labeling technique, Deep Feature Annotation (DeepFA), by selecting the most confident unsupervised samples to iteratively train a deep neural network. Our confidence-based sampling strategy relies on only dozens of annotated training samples per class with no validation set, considerably reducing user effort in data annotation. We first ascertain the best configuration for the baseline – a self-trained deep neural network – and then evaluate our confidence DeepFA for different confidence thresholds. Experiments on six datasets show that DeepFA already outperforms the self-trained baseline, but confidence DeepFA can considerably outperform the original DeepFA and the baseline.

READ FULL TEXT

page 3

page 6

research
10/19/2022

Non-iterative optimization of pseudo-labeling thresholds for training object detection models from multiple datasets

We propose a non-iterative method to optimize pseudo-labeling thresholds...
research
01/10/2023

Neighborhood-Regularized Self-Training for Learning with Few Labels

Training deep neural networks (DNNs) with limited supervision has been a...
research
03/04/2020

Annotation-free Learning of Deep Representations for Word Spotting using Synthetic Data and Self Labeling

Word spotting is a popular tool for supporting the first exploration of ...
research
08/02/2020

Semi-supervised deep learning based on label propagation in a 2D embedded space

While convolutional neural networks need large labeled sets for training...
research
06/06/2019

Extreme Points Derived Confidence Map as a Cue For Class-Agnostic Segmentation Using Deep Neural Network

To automate the process of segmenting an anatomy of interest, we can lea...
research
02/17/2020

Subset Sampling For Progressive Neural Network Learning

Progressive Neural Network Learning is a class of algorithms that increm...
research
10/12/2020

A catalog of broad morphology of Pan-STARRS galaxies based on deep learning

Autonomous digital sky surveys such as Pan-STARRS have the ability to im...

Please sign up or login with your details

Forgot password? Click here to reset