Coupled Bootstrap Test Error Estimation for Poisson Variables

12/04/2022
by   Natalia L. Oliveira, et al.
0

Test error estimation is a fundamental problem in statistics and machine learning. Correctly assessing the future performance of an algorithm is an essential task, especially with the development of complex predictive algorithms that require data-driven parameter tuning. We propose a new coupled bootstrap estimator for the test error of Poisson-response algorithms, a fundamental model for count data and with applications such as signal processing, density estimation, and queue theory. The idea behind our estimator is to generate two carefully designed new random vectors from the original data, where one acts as a training sample and the other as a test set. It is unbiased for an intuitive parameter: the out-of-sample error of a Poisson random vector whose mean has been shrunken by a small factor. Moreover, in a limiting regime, the coupled bootstrap estimator recovers an exactly unbiased estimator for test error. Our framework is applicable to loss functions of the Bregman divergence family, and our analysis and examples focus on two important cases: Poisson likelihood deviance and squared loss. Through a bias-variance decomposition, we analyze the effect of the number of bootstrap samples and the added noise due to the two auxiliary variables. We then apply our method to different scenarios with both simulated and real data.

READ FULL TEXT

page 9

page 11

page 13

research
11/17/2021

Unbiased Risk Estimation in the Normal Means Problem via Coupled Bootstrap Techniques

We study a new method for estimating the risk of an arbitrary estimator ...
research
11/11/2020

Predictive risk estimation for the Expectation Maximization algorithm with Poisson data

In this work, we introduce a novel estimator of the predictive risk with...
research
02/02/2023

Role of Bootstrap Averaging in Generalized Approximate Message Passing

Generalized approximate message passing (GAMP) is a computationally effi...
research
06/12/2022

Solving the Poisson equation using coupled Markov chains

This article draws connections between unbiased estimators constructed f...
research
06/17/2021

Distributionally Weighted Least Squares in Structural Equation Modeling

In real data analysis with structural equation modeling, data are unlike...
research
07/04/2016

Bootstrap Model Aggregation for Distributed Statistical Learning

In distributed, or privacy-preserving learning, we are often given a set...
research
07/26/2021

Debiasing In-Sample Policy Performance for Small-Data, Large-Scale Optimization

Motivated by the poor performance of cross-validation in settings where ...

Please sign up or login with your details

Forgot password? Click here to reset