A Bottom-up Approach to Testing Hypotheses That Have a Branching Tree Dependence Structure, with False Discovery Rate Control

03/16/2019
by   Yunxiao Li, et al.
0

Modern statistical analyses often involve testing large numbers of hypotheses. In many situations, these hypotheses may have an underlying tree structure that not only helps determine the order that tests should be conducted but also imposes a dependency between tests that must be accounted for. Our motivating example comes from testing the association between a trait of interest and groups of microbes that have been organized into operational taxonomic units (OTUs) or amplicon sequence variants (ASVs). Given p-values from association tests for each individual OTU or ASV, we would like to know if we can declare that a certain species, genus, or higher taxonomic grouping can be considered to be associated with the trait. For this problem, a bottom-up testing algorithm that starts at the lowest level of the tree (OTUs or ASVs) and proceeds upward through successively higher taxonomic groupings (species, genus, family etc.) is required. We develop such a bottom-up testing algorithm that controls the error rate of decisions made at higher levels in the tree, conditional on findings at lower levels in the tree. We further show this algorithm controls the false discovery rate based on the global null hypothesis that no taxa are associated with the trait. By simulation, we also show that our approach is better at finding driver taxa, the highest level taxa below which there are dense association signals. We illustrate our approach using data from a study of the microbiome among patients with ulcerative colitis and healthy controls.

READ FULL TEXT
research
08/10/2021

'Too Many, Too Improbable' test statistics: A general method for testing joint hypotheses and controlling the k-FWER

Hypothesis testing is a key part of empirical science and multiple testi...
research
10/04/2021

Online Control of the False Discovery Rate under "Decision Deadlines"

Online testing procedures aim to control the extent of false discoveries...
research
08/11/2021

Controlling the False Split Rate in Tree-Based Aggregation

In many domains, data measurements can naturally be associated with the ...
research
09/06/2018

Controlling FDR while highlighting distinct discoveries

Often modern scientific investigations start by testing a very large num...
research
09/28/2020

Hierarchical correction of p-values via a tree running Ornstein-Uhlenbeck process

Statistical testing is classically used as an exploratory tool to search...
research
05/03/2020

ProgPermute: Progressive permutation for a dynamic representation of the robustness of microbiome discoveries

Identification of significant features is a critical task in microbiome ...
research
12/01/2018

A Family-based Graphical Approach for Testing Hierarchically Ordered Families of Hypotheses

In applications of clinical trials, tested hypotheses are often grouped ...

Please sign up or login with your details

Forgot password? Click here to reset