Efficient Training of Energy-Based Models Using Jarzynski Equality

05/30/2023
by   Davide Carbone, et al.
0

Energy-based models (EBMs) are generative models inspired by statistical physics with a wide range of applications in unsupervised learning. Their performance is best measured by the cross-entropy (CE) of the model distribution relative to the data distribution. Using the CE as the objective for training is however challenging because the computation of its gradient with respect to the model parameters requires sampling the model distribution. Here we show how results for nonequilibrium thermodynamics based on Jarzynski equality together with tools from sequential Monte-Carlo sampling can be used to perform this computation efficiently and avoid the uncontrolled approximations made using the standard contrastive divergence algorithm. Specifically, we introduce a modification of the unadjusted Langevin algorithm (ULA) in which each walker acquires a weight that enables the estimation of the gradient of the cross-entropy at any step during GD, thereby bypassing sampling biases induced by slow mixing of ULA. We illustrate these results with numerical experiments on Gaussian mixture distributions as well as the MNIST dataset. We show that the proposed approach outperforms methods based on the contrastive divergence algorithm in all the considered situations.

READ FULL TEXT

page 9

page 22

page 24

research
12/02/2020

Improved Contrastive Divergence Training of Energy Based Models

We propose several different techniques to improve contrastive divergenc...
research
10/12/2021

Rare Events via Cross-Entropy Population Monte Carlo

We present a Cross-Entropy based population Monte Carlo algorithm. This ...
research
03/06/2020

Training Deep Energy-Based Models with f-Divergence Minimization

Deep energy-based models (EBMs) are very flexible in distribution parame...
research
11/03/2022

Can RBMs be trained with zero step contrastive divergence?

Restricted Boltzmann Machines (RBMs) are probabilistic generative models...
research
05/29/2023

Cross-Entropy Estimators for Sequential Experiment Design with Reinforcement Learning

Reinforcement learning can effectively learn amortised design policies f...
research
10/03/2019

Efficient training of energy-based models via spin-glass control

We present an efficient method for unsupervised learning using Boltzmann...
research
10/19/2022

Gaussian-Bernoulli RBMs Without Tears

We revisit the challenging problem of training Gaussian-Bernoulli restri...

Please sign up or login with your details

Forgot password? Click here to reset