Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels

02/21/2023
by   Zebin You, et al.
0

We propose a three-stage training strategy called dual pseudo training (DPT) for conditional image generation and classification in semi-supervised learning. First, a classifier is trained on partially labeled data and predicts pseudo labels for all data. Second, a conditional generative model is trained on all data with pseudo labels and generates pseudo images given labels. Finally, the classifier is trained on real data augmented by pseudo images with labels. We demonstrate large-scale diffusion models and semi-supervised learners benefit mutually with a few labels via DPT. In particular, on the ImageNet 256x256 generation benchmark, DPT can generate realistic, diverse, and semantically correct images with very few labels. With two (i.e., < 0.2 five (i.e., < 0.4 respectively, outperforming strong diffusion models with full labels, such as IDDPM, CDM, ADM, and LDM. Besides, DPT outperforms competitive semi-supervised baselines substantially on ImageNet classification benchmarks with one, two, and five labels per class, achieving state-of-the-art top-1 accuracies of 59.0 (+2.8), 69.5 (+3.0), and 73.6 (+1.2) respectively.

READ FULL TEXT

page 3

page 4

page 7

page 8

page 15

page 16

page 17

research
08/09/2019

Repetitive Reprediction Deep Decipher for Semi-Supervised Learning

Most recent semi-supervised deep learning (deep SSL) methods used a simi...
research
03/30/2022

Semi-Supervised Learning of Semantic Correspondence with Pseudo-Labels

Establishing dense correspondences across semantically similar images re...
research
01/05/2022

Debiased Learning from Naturally Imbalanced Pseudo-Labels for Zero-Shot and Semi-Supervised Learning

This work studies the bias issue of pseudo-labeling, a natural phenomeno...
research
02/18/2022

R2-D2: Repetitive Reprediction Deep Decipher for Semi-Supervised Deep Learning

Most recent semi-supervised deep learning (deep SSL) methods used a simi...
research
03/06/2019

High-Fidelity Image Generation With Fewer Labels

Deep generative models are becoming a cornerstone of modern machine lear...
research
01/12/2023

SemPPL: Predicting pseudo-labels for better contrastive representations

Learning from large amounts of unsupervised data and a small amount of s...
research
08/27/2020

A Consistent Diffusion-Based Algorithm for Semi-Supervised Classification on Graphs

Semi-supervised classification on graphs aims at assigning labels to all...

Please sign up or login with your details

Forgot password? Click here to reset