A comparison of learning rate selection methods in generalized Bayesian inference

by   Pei-Shien Wu, et al.

Generalized Bayes posterior distributions are formed by putting a fractional power on the likelihood before combining with the prior via Bayes's formula. This fractional power, which is often viewed as a remedy for potential model misspecification bias, is called the learning rate, and a number of data-driven learning rate selection methods have been proposed in the recent literature. Each of these proposals has a different focus, a different target they aim to achieve, which makes them difficult to compare. In this paper, we provide a direct head-to-head comparison of these learning rate selection methods in various misspecified model scenarios, in terms of several relevant metrics, in particular, coverage probability of the generalized Bayes credible regions. In some examples all the methods perform well, while in others the misspecification is too severe to be overcome, but we find that the so-called generalized posterior calibration algorithm tends to outperform the others in terms of credible region coverage probability.


page 1

page 2

page 3

page 4


Safe-Bayesian Generalized Linear Regression

We study generalized Bayesian inference under misspecification, i.e. whe...

Evolving Learning Rate Optimizers for Deep Neural Networks

Artificial Neural Networks (ANNs) became popular due to their successful...

Bayes-optimal prediction with frequentist coverage control

This article illustrates how indirect or prior information can be optima...

Practical calibration of the temperature parameter in Gibbs posteriors

PAC-Bayesian algorithms and Gibbs posteriors are gaining popularity due ...

Calibrating generalized predictive distributions

In prediction problems, it is common to model the data-generating proces...

Generalized Bayes inference on a linear personalized minimum clinically important difference

Inference on the minimum clinically important difference, or MCID, is an...

Diagnosis of Multiple Faults: A Sensitivity Analysis

We compare the diagnostic accuracy of three diagnostic inference models:...