Training Energy-Based Models with Diffusion Contrastive Divergences

07/04/2023
by   Weijian Luo, et al.
0

Energy-Based Models (EBMs) have been widely used for generative modeling. Contrastive Divergence (CD), a prevailing training objective for EBMs, requires sampling from the EBM with Markov Chain Monte Carlo methods (MCMCs), which leads to an irreconcilable trade-off between the computational burden and the validity of the CD. Running MCMCs till convergence is computationally intensive. On the other hand, short-run MCMC brings in an extra non-negligible parameter gradient term that is difficult to handle. In this paper, we provide a general interpretation of CD, viewing it as a special instance of our proposed Diffusion Contrastive Divergence (DCD) family. By replacing the Langevin dynamic used in CD with other EBM-parameter-free diffusion processes, we propose a more efficient divergence. We show that the proposed DCDs are both more computationally efficient than the CD and are not limited to a non-negligible gradient term. We conduct intensive experiments, including both synthesis data modeling and high-dimensional image denoising and generation, to show the advantages of the proposed DCDs. On the synthetic data learning and image denoising experiments, our proposed DCD outperforms CD by a large margin. In image generation experiments, the proposed DCD is capable of training an energy-based model for generating the Celab-A 32× 32 dataset, which is comparable to existing EBMs.

READ FULL TEXT

page 8

page 9

research
04/21/2023

Persistently Trained, Diffusion-assisted Energy-based Models

Maximum likelihood (ML) learning for energy-based models (EBMs) is chall...
research
12/02/2020

Improved Contrastive Divergence Training of Energy Based Models

We propose several different techniques to improve contrastive divergenc...
research
07/14/2023

Training Discrete Energy-Based Models with Energy Discrepancy

Training energy-based models (EBMs) on discrete spaces is challenging be...
research
02/24/2022

Clarifying MCMC-based training of modern EBMs : Contrastive Divergence versus Maximum Likelihood

The Energy-Based Model (EBM) framework is a very general approach to gen...
research
03/20/2023

Constructing Bayesian Pseudo-Coresets using Contrastive Divergence

Bayesian Pseudo-Coreset (BPC) and Dataset Condensation are two parallel ...
research
09/11/2023

Revisiting Energy Based Models as Policies: Ranking Noise Contrastive Estimation and Interpolating Energy Models

A crucial design decision for any robot learning pipeline is the choice ...
research
12/28/2018

Divergence Triangle for Joint Training of Generator Model, Energy-based Model, and Inference Model

This paper proposes the divergence triangle as a framework for joint tra...

Please sign up or login with your details

Forgot password? Click here to reset