On the Pitfalls of Heteroscedastic Uncertainty Estimation with Probabilistic Neural Networks

03/17/2022
by   Maximilian Seitzer, et al.
7

Capturing aleatoric uncertainty is a critical part of many machine learning systems. In deep learning, a common approach to this end is to train a neural network to estimate the parameters of a heteroscedastic Gaussian distribution by maximizing the logarithm of the likelihood function under the observed data. In this work, we examine this approach and identify potential hazards associated with the use of log-likelihood in conjunction with gradient-based optimizers. First, we present a synthetic example illustrating how this approach can lead to very poor but stable parameter estimates. Second, we identify the culprit to be the log-likelihood loss, along with certain conditions that exacerbate the issue. Third, we present an alternative formulation, termed β-NLL, in which each data point's contribution to the loss is weighted by the β-exponentiated variance estimate. We show that using an appropriate β largely mitigates the issue in our illustrative example. Fourth, we evaluate this approach on a range of domains and tasks and show that it achieves considerable improvements and performs more robustly concerning hyperparameters, both in predictive RMSE and log-likelihood criteria.

READ FULL TEXT

page 4

page 14

page 19

page 20

research
01/12/2020

Unbiased and Efficient Log-Likelihood Estimation with Inverse Binomial Sampling

The fate of scientific hypotheses often relies on the ability of a compu...
research
12/29/2018

Multivariate Arrival Times with Recurrent Neural Networks for Personalized Demand Forecasting

Access to a large variety of data across a massive population has made i...
research
06/17/2020

Probabilistic orientation estimation with matrix Fisher distributions

This paper focuses on estimating probability distributions over the set ...
research
06/27/2020

Thermodynamic Machine Learning through Maximum Work Production

Adaptive thermodynamic systems – such as a biological organism attemptin...
research
02/21/2023

Computational issues in parameter estimation for hidden Markov models with Template Model Builder

A popular way to estimate the parameters of a hidden Markov model (HMM) ...
research
02/17/2017

Predicting Surgery Duration with Neural Heteroscedastic Regression

Scheduling surgeries is a challenging task due to the fundamental uncert...
research
04/20/2021

Deep learning with transfer functions: new applications in system identification

This paper presents a linear dynamical operator described in terms of a ...

Please sign up or login with your details

Forgot password? Click here to reset