Speeding Up MCMC by Efficient Data Subsampling

04/16/2014
by   Matias Quiroz, et al.
0

We propose Subsampling MCMC, a Markov Chain Monte Carlo (MCMC) framework where the likelihood function for n observations is estimated from a random subset of m observations. We introduce a highly efficient unbiased estimator of the log-likelihood based on control variates, such that the computing cost is much smaller than that of the full log-likelihood in standard MCMC. The likelihood estimate is bias-corrected and used in two dependent pseudo-marginal algorithms to sample from a perturbed posterior, for which we derive the asymptotic error with respect to n and m, respectively. Our analysis allows the number of variables p to grow with n. We propose a practical estimator of the error and show that the error is negligible even for a very small m in our applications. We demonstrate that Subsampling MCMC is substantially more efficient than standard MCMC in terms of sampling efficiency for a given computational budget, and that it outperforms other subsampling methods for MCMC proposed in the literature.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2020

Most Likely Optimal Subsampled Markov Chain Monte Carlo

Markov Chain Monte Carlo (MCMC) requires to evaluate the full data likel...
research
03/27/2016

Exact Subsampling MCMC

Speeding up Markov Chain Monte Carlo (MCMC) for data sets with many obse...
research
04/19/2013

Austerity in MCMC Land: Cutting the Metropolis-Hastings Budget

Can we make Bayesian posterior MCMC sampling more efficient when faced w...
research
10/16/2020

Minimax Quasi-Bayesian estimation in sparse canonical correlation analysis via a Rayleigh quotient function

Canonical correlation analysis (CCA) is a popular statistical technique ...
research
10/06/2015

Population-Contrastive-Divergence: Does Consistency help with RBM training?

Estimating the log-likelihood gradient with respect to the parameters of...
research
07/23/2018

Subsampling MCMC - A review for the survey statistician

The rapid development of computing power and efficient Markov Chain Mont...
research
07/09/2021

Fast compression of MCMC output

We propose cube thinning, a novel method for compressing the output of a...

Please sign up or login with your details

Forgot password? Click here to reset