JEM++: Improved Techniques for Training JEM

09/19/2021
by   Xiulong Yang, et al.
1

Joint Energy-based Model (JEM) is a recently proposed hybrid model that retains strong discriminative power of modern CNN classifiers, while generating samples rivaling the quality of GAN-based approaches. In this paper, we propose a variety of new training procedures and architecture features to improve JEM's accuracy, training stability, and speed altogether. 1) We propose a proximal SGLD to generate samples in the proximity of samples from the previous step, which improves the stability. 2) We further treat the approximate maximum likelihood learning of EBM as a multi-step differential game, and extend the YOPO framework to cut out redundant calculations during backpropagation, which accelerates the training substantially. 3) Rather than initializing SGLD chain from random noise, we introduce a new informative initialization that samples from a distribution estimated from training data. 4) This informative initialization allows us to enable batch normalization in JEM, which further releases the power of modern CNN architectures for hybrid modeling. Code: https://github.com/sndnyang/JEMPP

READ FULL TEXT

page 7

page 11

page 13

page 14

page 15

page 16

page 17

research
12/06/2019

Your Classifier is Secretly an Energy Based Model and You Should Treat it Like One

We propose to reinterpret a standard discriminative classifier of p(y|x)...
research
04/13/2017

On the Effects of Batch and Weight Normalization in Generative Adversarial Networks

Generative adversarial networks (GANs) are highly effective unsupervised...
research
03/29/2019

On the Anatomy of MCMC-based Maximum Likelihood Learning of Energy-Based Models

This study investigates the effects Markov Chain Monte Carlo (MCMC) samp...
research
03/08/2023

M-EBM: Towards Understanding the Manifolds of Energy-Based Models

Energy-based models (EBMs) exhibit a variety of desirable properties in ...
research
08/21/2023

CoNe: Contrast Your Neighbours for Supervised Image Classification

Image classification is a longstanding problem in computer vision and ma...
research
03/13/2023

Adaptive Data-Free Quantization

Data-free quantization (DFQ) recovers the performance of quantized netwo...
research
09/14/2022

Improved proteasomal cleavage prediction with positive-unlabeled learning

Accurate in silico modeling of the antigen processing pathway is crucial...

Please sign up or login with your details

Forgot password? Click here to reset