On the expected runtime of multiple testing algorithms with bounded error

07/27/2018
by   Georg Hahn, et al.
0

Consider the testing of multiple hypotheses in the setting where the p-values of all hypotheses are unknown and thus have to be approximated using Monte Carlo simulations. One class of algorithms published in the literature for this scenario ensures guarantees on the correctness of their testing result (for instance, a bound on the resampling risk) through the computation of confidence statements on all approximated p-values. This article focuses on the expected runtime of those algorithms and shows the following four main results. Computing a decision on a single hypothesis tested at a fixed threshold requires an infinite expected runtime. In applications relying on the decisions of multiple hypotheses computed with a Bonferroni-type threshold, all but two hypotheses can be decided in finite expected runtime. This result does not extend to applications which require full knowledge of all individual decisions (for instance, step-up or step-down procedures), in which case no algorithm can guarantee even a single decision in finite expected runtime. Nevertheless simulations show that in practice, the number of pending decisions typically remains low.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2022

E-values as unnormalized weights in multiple testing

Most standard weighted multiple testing methods require the weights to d...
research
09/19/2018

Online control of the false discovery rate in biomedical research

Modern biomedical research frequently involves testing multiple related ...
research
09/25/2020

Theoretical Justification of the Bi Error Method

Incorrect usage of p-values, particularly within the context of signific...
research
04/20/2023

Adaptive Greedy Rejection Sampling

We consider channel simulation protocols between two communicating parti...
research
04/19/2023

Asymptotically Optimal Sequential Multiple Testing with Asynchronous Decisions

The problem of simultaneously testing the marginal distributions of sequ...
research
07/06/2021

When to adjust alpha during multiple testing: A consideration of disjunction, conjunction, and individual testing

Scientists often adjust their significance threshold (alpha level) durin...
research
08/21/2023

Harnessing The Collective Wisdom: Fusion Learning Using Decision Sequences From Diverse Sources

Learning from the collective wisdom of crowds enhances the transparency ...

Please sign up or login with your details

Forgot password? Click here to reset