Class-Balancing Diffusion Models

04/30/2023
by   Yiming Qin, et al.
0

Diffusion-based models have shown the merits of generating high-quality visual data while preserving better diversity in recent studies. However, such observation is only justified with curated data distribution, where the data samples are nicely pre-processed to be uniformly distributed in terms of their labels. In practice, a long-tailed data distribution appears more common and how diffusion models perform on such class-imbalanced data remains unknown. In this work, we first investigate this problem and observe significant degradation in both diversity and fidelity when the diffusion model is trained on datasets with class-imbalanced distributions. Especially in tail classes, the generations largely lose diversity and we observe severe mode-collapse issues. To tackle this problem, we set from the hypothesis that the data distribution is not class-balanced, and propose Class-Balancing Diffusion Models (CBDM) that are trained with a distribution adjustment regularizer as a solution. Experiments show that images generated by CBDM exhibit higher diversity and quality in both quantitative and qualitative ways. Our method benchmarked the generation results on CIFAR100/CIFAR100LT dataset and shows outstanding performance on the downstream recognition task.

READ FULL TEXT

page 5

page 8

page 14

page 15

research
08/21/2022

Improving GANs for Long-Tailed Data through Group Spectral Regularization

Deep long-tailed learning aims to train useful deep networks on practica...
research
04/24/2023

Towards Mode Balancing of Generative Models via Diversity Weights

Large data-driven image models are extensively used to support creative ...
research
06/25/2023

DiffMix: Diffusion Model-based Data Synthesis for Nuclei Segmentation and Classification in Imbalanced Pathology Image Datasets

Nuclei segmentation and classification is a significant process in patho...
research
07/17/2023

Manifold-Guided Sampling in Diffusion Models for Unbiased Image Generation

Diffusion models are a powerful class of generative models that can prod...
research
01/21/2022

To SMOTE, or not to SMOTE?

In imbalanced binary classification problems the objective metric is oft...
research
06/14/2023

Data Augmentation for Seizure Prediction with Generative Diffusion Model

Objective: Seizure prediction is of great importance to improve the life...
research
04/25/2022

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Unconditional human image generation is an important task in vision and ...

Please sign up or login with your details

Forgot password? Click here to reset