Learning Multi-grid Generative ConvNets by Minimal Contrastive Divergence

09/26/2017
by   Ruiqi Gao, et al.
0

This paper proposes a minimal contrastive divergence method for learning energy-based generative ConvNet models of images at multiple grids (or scales) simultaneously. For each grid, we learn an energy-based probabilistic model where the energy function is defined by a bottom-up convolutional neural network (ConvNet or CNN). Learning such a model requires generating synthesized examples from the model. Within each iteration of our learning algorithm, for each observed training image, we generate synthesized images at multiple grids by initializing the finite-step MCMC sampling from a minimal 1 x 1 version of the training image. The synthesized image at each subsequent grid is obtained by a finite-step MCMC initialized from the synthesized image generated at the previous coarser grid. After obtaining the synthesized examples, the parameters of the models at multiple grids are updated separately and simultaneously based on the differences between synthesized and observed examples. We call this learning method the multi-grid minimal contrastive divergence. We show that this method can learn realistic energy-based generative ConvNet models, and it outperforms the original contrastive divergence (CD) and persistent CD.

READ FULL TEXT

page 1

page 7

page 8

page 9

research
02/24/2022

Clarifying MCMC-based training of modern EBMs : Contrastive Divergence versus Maximum Likelihood

The Energy-Based Model (EBM) framework is a very general approach to gen...
research
04/07/2023

Conservative objective models are a special kind of contrastive divergence-based energy model

In this work we theoretically show that conservative objective models (C...
research
07/04/2017

Learning Deep Energy Models: Contrastive Divergence vs. Amortized MLE

We propose a number of new algorithms for learning deep energy models an...
research
12/28/2018

Divergence Triangle for Joint Training of Generator Model, Energy-based Model, and Inference Model

This paper proposes the divergence triangle as a framework for joint tra...
research
04/22/2019

On Learning Non-Convergent Short-Run MCMC Toward Energy-Based Model

This paper studies a curious phenomenon in learning energy-based model (...
research
08/10/2015

Feature-based Decipherment for Large Vocabulary Machine Translation

Orthographic similarities across languages provide a strong signal for p...
research
05/03/2014

Why (and When and How) Contrastive Divergence Works

Contrastive divergence (CD) is a promising method of inference in high d...

Please sign up or login with your details

Forgot password? Click here to reset