Towards Controllable Diffusion Models via Reward-Guided Exploration

04/14/2023
by   Hengtong Zhang, et al.
0

By formulating data samples' formation as a Markov denoising process, diffusion models achieve state-of-the-art performances in a collection of tasks. Recently, many variants of diffusion models have been proposed to enable controlled sample generation. Most of these existing methods either formulate the controlling information as an input (i.e.,: conditional representation) for the noise approximator, or introduce a pre-trained classifier in the test-phase to guide the Langevin dynamic towards the conditional goal. However, the former line of methods only work when the controlling information can be formulated as conditional representations, while the latter requires the pre-trained guidance classifier to be differentiable. In this paper, we propose a novel framework named RGDM (Reward-Guided Diffusion Model) that guides the training-phase of diffusion models via reinforcement learning (RL). The proposed training framework bridges the objective of weighted log-likelihood and maximum entropy RL, which enables calculating policy gradients via samples from a pay-off distribution proportional to exponential scaled rewards, rather than from policies themselves. Such a framework alleviates the high gradient variances and enables diffusion models to explore for highly rewarded samples in the reverse process. Experiments on 3D shape and molecule generation tasks show significant improvements over existing conditional diffusion models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/06/2022

On Distillation of Guided Diffusion Models

Classifier-free guided diffusion models have recently been shown to be h...
research
02/05/2023

ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion Trajectories

Diffusion models have recently exhibited remarkable abilities to synthes...
research
03/17/2023

FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model

Recently, conditional diffusion models have gained popularity in numerou...
research
05/22/2023

Training Diffusion Models with Reinforcement Learning

Diffusion models are a class of flexible generative models trained with ...
research
06/23/2022

Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation

Denoising Diffusion Probabilistic Model (DDPM) is able to make flexible ...
research
06/02/2023

PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models

This paper presents PolyDiffuse, a novel structured reconstruction algor...
research
06/15/2023

Training Diffusion Classifiers with Denoising Assistance

Score-matching and diffusion models have emerged as state-of-the-art gen...

Please sign up or login with your details

Forgot password? Click here to reset