Empirical Bayes large-scale multiple testing for high-dimensional sparse binary sequences

07/12/2023
by   Bo Y. -C. Ning, et al.
0

This paper investigates the multiple testing problem for high-dimensional sparse binary sequences motivated by the crowdsourcing problem in machine learning. We adopt an empirical Bayes approach to estimate possibly sparse sequences with Bernoulli noises. We found a surprising result that the hard thresholding rule deduced from the spike-and-slab posterior is not optimal, even using a uniform prior. Two approaches are then proposed to calibrate the posterior for achieving the optimal signal detection boundary, and two multiple testing procedures are constructed based on these calibrated posteriors. Sharp frequentist theoretical results for these procedures are obtained, showing both can effectively control the false discovery rate uniformly for signals under a sparsity assumption. Numerical experiments are conducted to validate our theory in finite samples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/29/2018

On spike and slab empirical Bayes multiple testing

This paper explores a connection between empirical Bayes posterior distr...
research
09/28/2021

Sharp multiple testing boundary for sparse sequences

This work investigates multiple testing from the point of view of minima...
research
06/16/2021

Nonparametric Empirical Bayes Estimation and Testing for Sparse and Heteroscedastic Signals

Large-scale modern data often involves estimation and testing for high-d...
research
12/18/2018

Solving the Empirical Bayes Normal Means Problem with Correlated Noise

The Normal Means problem plays a fundamental role in many areas of moder...
research
02/01/2021

Empirical Bayes cumulative ℓ-value multiple testing procedure for sparse sequences

In the sparse sequence model, we consider a popular Bayesian multiple te...
research
01/05/2018

Empirical Bayes analysis of spike and slab posterior distributions

In the sparse normal means model, convergence of the Bayesian posterior ...
research
08/13/2022

Machine learning meets false discovery rate

Classical false discovery rate (FDR) controlling procedures offer strong...

Please sign up or login with your details

Forgot password? Click here to reset