NeurT-FDR: Controlling FDR by Incorporating Feature Hierarchy

01/24/2021
by   Lin Qiu, et al.
0

Controlling false discovery rate (FDR) while leveraging the side information of multiple hypothesis testing is an emerging research topic in modern data science. Existing methods rely on the test-level covariates while ignoring possible hierarchy among the covariates. This strategy may not be optimal for complex large-scale problems, where hierarchical information often exists among those test-level covariates. We propose NeurT-FDR which boosts statistical power and controls FDR for multiple hypothesis testing while leveraging the hierarchy among test-level covariates. Our method parametrizes the test-level covariates as a neural network and adjusts the feature hierarchy through a regression framework, which enables flexible handling of high-dimensional features as well as efficient end-to-end optimization. We show that NeurT-FDR has strong FDR guarantees and makes substantially more discoveries in synthetic and real datasets compared to competitive baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/06/2022

Probabilistic Model Incorporating Auxiliary Covariates to Control FDR

Controlling False Discovery Rate (FDR) while leveraging the side informa...
research
11/03/2017

NeuralFDR: Learning Discovery Thresholds from Hypothesis Features

As datasets grow richer, an important challenge is to leverage the full ...
research
01/19/2019

Custodes: Auditable Hypothesis Testing

We present Custodes: a new approach to solving the complex issue of prev...
research
04/30/2021

Efficient Multiple Testing Adjustment for Hierarchical Inference

Hierarchical inference in (generalized) regression problems is powerful ...
research
08/11/2021

Controlling the False Split Rate in Tree-Based Aggregation

In many domains, data measurements can naturally be associated with the ...
research
07/13/2021

For high-dimensional hierarchical models, consider exchangeability of effects across covariates instead of across datasets

Hierarchical Bayesian methods enable information sharing across multiple...
research
12/01/2021

Controlling for multiple covariates

A fundamental problem in statistics is to compare the outcomes attained ...

Please sign up or login with your details

Forgot password? Click here to reset