Mitigating Out-of-Distribution Data Density Overestimation in Energy-Based Models

05/30/2022
by   Beomsu Kim, et al.
0

Deep energy-based models (EBMs), which use deep neural networks (DNNs) as energy functions, are receiving increasing attention due to their ability to learn complex distributions. To train deep EBMs, the maximum likelihood estimation (MLE) with short-run Langevin Monte Carlo (LMC) is often used. While the MLE with short-run LMC is computationally efficient compared to an MLE with full Markov Chain Monte Carlo (MCMC), it often assigns high density to out-of-distribution (OOD) data. To address this issue, here we systematically investigate why the MLE with short-run LMC can converge to EBMs with wrong density estimates, and reveal that the heuristic modifications to LMC introduced by previous works were the main problem. We then propose a Uniform Support Partitioning (USP) scheme that optimizes a set of points to evenly partition the support of the EBM and then uses the resulting points to approximate the EBM-MLE loss gradient. We empirically demonstrate that USP avoids the pitfalls of short-run LMC, leading to significantly improved OOD data detection performance on Fashion-MNIST.

READ FULL TEXT

page 8

page 9

research
03/29/2019

On the Anatomy of MCMC-based Maximum Likelihood Learning of Energy-Based Models

This study investigates the effects Markov Chain Monte Carlo (MCMC) samp...
research
04/21/2023

Persistently Trained, Diffusion-assisted Energy-based Models

Maximum likelihood (ML) learning for energy-based models (EBMs) is chall...
research
08/26/2023

Learning variational autoencoders via MCMC speed measures

Variational autoencoders (VAEs) are popular likelihood-based generative ...
research
07/14/2023

Training Discrete Energy-Based Models with Energy Discrepancy

Training energy-based models (EBMs) on discrete spaces is challenging be...
research
07/08/2020

Deep Fiducial Inference

Since the mid-2000s, there has been a resurrection of interest in modern...
research
09/08/2021

Grid-Uniform Copulas and Rectangle Exchanges: Bayesian Model and Inference for a Rich Class of Copula Functions

Copula-based models provide a great deal of flexibility in modelling mul...
research
06/09/2021

ergm 4.0: New features and improvements

The ergm package supports the statistical analysis and simulation of net...

Please sign up or login with your details

Forgot password? Click here to reset