Guiding Energy-based Models via Contrastive Latent Variables

03/06/2023
by   Hankook Lee, et al.
0

An energy-based model (EBM) is a popular generative framework that offers both explicit density and architectural flexibility, but training them is difficult since it is often unstable and time-consuming. In recent years, various training techniques have been developed, e.g., better divergence measures or stabilization in MCMC sampling, but there often exists a large gap between EBMs and other generative frameworks like GANs in terms of generation quality. In this paper, we propose a novel and effective framework for improving EBMs via contrastive representation learning (CRL). To be specific, we consider representations learned by contrastive methods as the true underlying latent variable. This contrastive latent variable could guide EBMs to understand the data structure better, so it can improve and accelerate EBM training significantly. To enable the joint training of EBM and CRL, we also design a new class of latent-variable EBMs for learning the joint density of data and the contrastive latent variable. Our experimental results demonstrate that our scheme achieves lower FID scores, compared to prior-art EBM methods (e.g., additionally using variational autoencoders or diffusion techniques), even with significantly faster and more memory-efficient training. We also show conditional and compositional generation abilities of our latent-variable EBMs as their additional benefits, even without explicit conditional training. The code is available at https://github.com/hankook/CLEL.

READ FULL TEXT

page 7

page 8

research
04/12/2018

Variational Composite Autoencoders

Learning in the latent variable model is challenging in the presence of ...
research
03/17/2021

Training GANs with Stronger Augmentations via Contrastive Discriminator

Recent works in Generative Adversarial Networks (GANs) are actively revi...
research
12/12/2017

GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

Directed latent variable models that formulate the joint distribution as...
research
06/25/2021

NP-DRAW: A Non-Parametric Structured Latent Variable Modelfor Image Generation

In this paper, we present a non-parametric structured latent variable mo...
research
05/12/2018

Gaussian Mixture Latent Vector Grammars

We introduce Latent Vector Grammars (LVeGs), a new framework that extend...
research
02/26/2020

ICE-BeeM: Identifiable Conditional Energy-Based Deep Models

Despite the growing popularity of energy-based models, their identifiabi...
research
04/04/2023

Fully Variational Noise-Contrastive Estimation

By using the underlying theory of proper scoring rules, we design a fami...

Please sign up or login with your details

Forgot password? Click here to reset