On the Generalization of Diffusion Model

05/24/2023
by   Mingyang Yi, et al.
0

The diffusion probabilistic generative models are widely used to generate high-quality data. Though they can synthetic data that does not exist in the training set, the rationale behind such generalization is still unexplored. In this paper, we formally define the generalization of the generative model, which is measured by the mutual information between the generated data and the training set. The definition originates from the intuition that the model which generates data with less correlation to the training set exhibits better generalization ability. Meanwhile, we show that for the empirical optimal diffusion model, the data generated by a deterministic sampler are all highly related to the training set, thus poor generalization. This result contradicts the observation of the trained diffusion model's (approximating empirical optima) extrapolation ability (generating unseen data). To understand this contradiction, we empirically verify the difference between the sufficiently trained diffusion model and the empirical optima. We found, though obtained through sufficient training, there still exists a slight difference between them, which is critical to making the diffusion model generalizable. Moreover, we propose another training objective whose empirical optimal solution has no potential generalization problem. We empirically show that the proposed training objective returns a similar model to the original one, which further verifies the generalization ability of the trained diffusion model.

READ FULL TEXT

page 6

page 23

page 24

page 25

page 26

page 27

page 28

page 29

research
02/03/2023

AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

Diffusion models have demonstrated their powerful generative capability ...
research
02/21/2023

Provable Copyright Protection for Generative Models

There is a growing concern that learned conditional generative models ma...
research
07/27/2022

Do Quantum Circuit Born Machines Generalize?

In recent proposals of quantum circuit models for generative tasks, the ...
research
08/22/2023

Hey That's Mine Imperceptible Watermarks are Preserved in Diffusion Generated Outputs

Generative models have seen an explosion in popularity with the release ...
research
12/04/2022

Image Deblurring with Domain Generalizable Diffusion Models

Diffusion Probabilistic Models (DPMs) have recently been employed for im...
research
02/03/2023

Learning End-to-End Channel Coding with Diffusion Models

It is a known problem that deep-learning-based end-to-end (E2E) channel ...
research
02/07/2023

Analyzing the Performance of Deep Encoder-Decoder Networks as Surrogates for a Diffusion Equation

Neural networks (NNs) have proven to be a viable alternative to traditio...

Please sign up or login with your details

Forgot password? Click here to reset