Online multiple hypothesis testing for reproducible research

08/24/2022
by   David S. Robertson, et al.
0

Modern data analysis frequently involves large-scale hypothesis testing, which naturally gives rise to the problem of maintaining control of a suitable type I error rate, such as the false discovery rate (FDR). In many biomedical and technological applications, an additional complexity is that hypotheses are tested in an online manner, one-by-one over time. However, traditional procedures that control the FDR, such as the Benjamini-Hochberg procedure, assume that all p-values are available to be tested at a single time point. To address these challenges, a new field of methodology has developed over the past 15 years showing how to control error rates for online multiple hypothesis testing. In this framework, hypotheses arrive in a stream, and at each time point the analyst decides whether to reject the current hypothesis based both on the evidence against it, and on the previous rejection decisions. In this paper, we present a comprehensive exposition of the literature on online error rate control, with a review of key theory as well as a focus on applied examples. We also provide simulation results comparing different online testing algorithms and an up-to-date overview of the many methodological extensions that have been proposed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/19/2018

Online control of the false discovery rate in biomedical research

Modern biomedical research frequently involves testing multiple related ...
research
02/20/2020

Familywise Error Rate Control by Interactive Unmasking

We propose a method for multiple hypothesis testing with familywise erro...
research
09/23/2019

A reckless guide to P-values: local evidence, global errors

This chapter demystifies P-values, hypothesis tests and significance tes...
research
04/29/2021

Querying multiple sets of p-values through composed hypothesis testing

Motivation: Combining the results of different experiments to exhibit co...
research
02/07/2019

Contextual Online False Discovery Rate Control

Multiple hypothesis testing, a situation when we wish to consider many h...
research
10/11/2019

The Power of Batching in Multiple Hypothesis Testing

One important partition of algorithms for controlling the false discover...
research
12/28/2019

Approval policies for modifications to Machine Learning-Based Software as a Medical Device: A study of bio-creep

Successful deployment of machine learning algorithms in healthcare requi...

Please sign up or login with your details

Forgot password? Click here to reset