Centroid Approximation for Bootstrap

10/17/2021
by   Mao Ye, et al.
5

Bootstrap is a principled and powerful frequentist statistical tool for uncertainty quantification. Unfortunately, standard bootstrap methods are computationally intensive due to the need of drawing a large i.i.d. bootstrap sample to approximate the ideal bootstrap distribution; this largely hinders their application in large-scale machine learning, especially deep learning problems. In this work, we propose an efficient method to explicitly optimize a small set of high quality "centroid" points to better approximate the ideal bootstrap distribution. We achieve this by minimizing a simple objective function that is asymptotically equivalent to the Wasserstein distance to the ideal bootstrap distribution. This allows us to provide an accurate estimation of uncertainty with a small number of bootstrap centroids, outperforming the naive i.i.d. sampling approach. Empirically, we show that our method can boost the performance of bootstrap in a variety of applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2021

Multiplier bootstrap for Bures-Wasserstein barycenters

Bures-Wasserstein barycenter is a popular and promising tool in analysis...
research
12/14/2021

The Importance of Discussing Assumptions when Teaching Bootstrapping

Bootstrapping and other resampling methods are progressively appearing i...
research
07/04/2021

A Comparison of the Delta Method and the Bootstrap in Deep Learning Classification

We validate the recently introduced deep learning classification adapted...
research
05/04/2022

Multivariate Prediction Intervals for Random Forests

Accurate uncertainty estimates can significantly improve the performance...
research
03/12/2018

Weighted Bayesian Bootstrap for Scalable Bayes

We develop a weighted Bayesian Bootstrap (WBB) for machine learning and ...
research
06/01/2020

Scalable Uncertainty Quantification via GenerativeBootstrap Sampler

It has been believed that the virtue of using statistical procedures is ...
research
02/18/2016

What is the distribution of the number of unique original items in a bootstrap sample?

Sampling with replacement occurs in many settings in machine learning, n...

Please sign up or login with your details

Forgot password? Click here to reset