Adaptive Monte Carlo Multiple Testing via Multi-Armed Bandits

02/01/2019
by   Martin J. Zhang, et al.
0

Monte Carlo (MC) permutation testing is considered the gold standard for statistical hypothesis testing, especially when standard parametric assumptions are not clear or likely to fail. However, in modern data science settings where a large number of hypothesis tests need to be performed simultaneously, it is rarely used due to its prohibitive computational cost. In genome-wide association studies, for example, the number of hypothesis tests m is around 10^6 while the number of MC samples n for each test could be greater than 10^8, totaling more than nm=10^14 samples. In this paper, we propose Adaptive MC Testing (AMT) to estimate MC p-values and control false discovery rate in multiple testing. The algorithm outputs the same result as the standard full MC approach with high probability while requiring only O(√(n)m) samples. This sample complexity is shown to be optimal. On a Parkinson GWAS dataset, the algorithm reduces the running time from 2 months for full MC to an hour. The AMT algorithm is derived based on the theory of multi-armed bandits.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/09/2023

Efficient Propagation of Uncertainty via Reordering Monte Carlo Samples

Uncertainty analysis in the outcomes of model predictions is a key eleme...
research
03/04/2019

Statistical approach to detection of signals by Monte Carlo singular spectrum analysis: Multiple testing

The statistical approach to detection of a signal in noisy series is con...
research
01/31/2023

Improving Monte Carlo Evaluation with Offline Data

Monte Carlo (MC) methods are the most widely used methods to estimate th...
research
05/27/2021

Nested sampling for frequentist computation: fast estimation of small p-values

We propose a novel method for computing p-values based on nested samplin...
research
10/13/2017

Parsimonious Adaptive Rejection Sampling

Monte Carlo (MC) methods have become very popular in signal processing d...
research
06/02/2020

Improved q-values for discrete uniform and homogeneous tests: a comparative study

Large scale discrete uniform and homogeneous P-values often arise in app...
research
02/19/2022

Graph Reparameterizations for Enabling 1000+ Monte Carlo Iterations in Bayesian Deep Neural Networks

Uncertainty estimation in deep models is essential in many real-world ap...

Please sign up or login with your details

Forgot password? Click here to reset