Bayesian Coreset Construction via Greedy Iterative Geodesic Ascent

02/05/2018
by   Trevor Campbell, et al.
0

Coherent uncertainty quantification is a key strength of Bayesian methods. But modern algorithms for approximate Bayesian posterior inference often sacrifice accurate posterior uncertainty estimation in the pursuit of scalability. This work shows that previous Bayesian coreset construction algorithms---which build a small, weighted subset of the data that approximates the full dataset---are no exception. We demonstrate that these algorithms scale the coreset log-likelihood suboptimally, resulting in underestimated posterior uncertainty. To address this shortcoming, we develop greedy iterative geodesic ascent (GIGA), a novel algorithm for Bayesian coreset construction that scales the coreset log-likelihood optimally. GIGA provides geometric decay in posterior approximation error as a function of coreset size, and maintains the fast running time of its predecessors. The paper concludes with validation of GIGA on both synthetic and real datasets, demonstrating that it reduces posterior approximation error by orders of magnitude compared with previous coreset constructions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2019

Quality of Uncertainty Quantification for Bayesian Neural Network Inference

Bayesian Neural Networks (BNNs) place priors over the parameters in a ne...
research
10/13/2017

Automated Scalable Bayesian Inference via Hilbert Coresets

The automation of posterior inference in Bayesian data analysis has enab...
research
12/01/2022

Are you using test log-likelihood correctly?

Test log-likelihood is commonly used to compare different models of the ...
research
01/14/2022

A generalized likelihood based Bayesian approach for scalable joint regression and covariance selection in high dimensions

The paper addresses joint sparsity selection in the regression coefficie...
research
06/07/2019

Sparse Variational Inference: Bayesian Coresets from Scratch

The proliferation of automated inference algorithms in Bayesian statisti...
research
06/09/2020

A generalized Bayes framework for probabilistic clustering

Loss-based clustering methods, such as k-means and its variants, are sta...
research
03/18/2022

Fast Bayesian Coresets via Subsampling and Quasi-Newton Refinement

Bayesian coresets approximate a posterior distribution by building a sma...

Please sign up or login with your details

Forgot password? Click here to reset