Learning Proposals for Practical Energy-Based Regression

10/22/2021
by   Fredrik K. Gustafsson, et al.
0

Energy-based models (EBMs) have experienced a resurgence within machine learning in recent years, including as a promising alternative for probabilistic regression. However, energy-based regression requires a proposal distribution to be manually designed for training, and an initial estimate has to be provided at test-time. We address both of these issues by introducing a conceptually simple method to automatically learn an effective proposal distribution, which is parameterized by a separate network head. To this end, we derive a surprising result, leading to a unified training objective that jointly minimizes the KL divergence from the proposal to the EBM, and the negative log-likelihood of the EBM. At test-time, we can then employ importance sampling with the trained proposal to efficiently evaluate the learned EBM and produce stand-alone predictions. Furthermore, we utilize our derived training objective to learn mixture density networks (MDNs) with a jointly trained energy-based teacher, consistently outperforming conventional MDN training on four real-world regression tasks within computer vision. Code is available at https://github.com/fregu856/ebms_proposals.

READ FULL TEXT
research
07/23/2021

Human Pose Regression with Residual Log-likelihood Estimation

Heatmap-based methods dominate in the field of human pose estimation by ...
research
01/15/2022

Parameter-free Online Test-time Adaptation

Training state-of-the-art vision models has become prohibitively expensi...
research
04/13/2022

Distributionally Robust Models with Parametric Likelihood Ratios

As machine learning models are deployed ever more broadly, it becomes in...
research
05/04/2020

How to Train Your Energy-Based Model for Regression

Energy-based models (EBMs) have become increasingly popular within compu...
research
10/18/2022

Optimizing Hierarchical Image VAEs for Sample Quality

While hierarchical variational autoencoders (VAEs) have achieved great d...
research
09/30/2022

Learning with MISELBO: The Mixture Cookbook

Mixture models in variational inference (VI) is an active field of resea...
research
02/28/2023

M-L2O: Towards Generalizable Learning-to-Optimize by Test-Time Fast Self-Adaptation

Learning to Optimize (L2O) has drawn increasing attention as it often re...

Please sign up or login with your details

Forgot password? Click here to reset