Probabilistic Model Incorporating Auxiliary Covariates to Control FDR

10/06/2022
by   Lin Qiu, et al.
0

Controlling False Discovery Rate (FDR) while leveraging the side information of multiple hypothesis testing is an emerging research topic in modern data science. Existing methods rely on the test-level covariates while ignoring metrics about test-level covariates. This strategy may not be optimal for complex large-scale problems, where indirect relations often exist among test-level covariates and auxiliary metrics or covariates. We incorporate auxiliary covariates among test-level covariates in a deep Black-Box framework controlling FDR (named as NeurT-FDR) which boosts statistical power and controls FDR for multiple-hypothesis testing. Our method parametrizes the test-level covariates as a neural network and adjusts the auxiliary covariates through a regression framework, which enables flexible handling of high-dimensional features as well as efficient end-to-end optimization. We show that NeurT-FDR makes substantially more discoveries in three real datasets compared to competitive baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/24/2021

NeurT-FDR: Controlling FDR by Incorporating Feature Hierarchy

Controlling false discovery rate (FDR) while leveraging the side informa...
research
03/29/2021

Optimal False Discovery Rate Control for Large Scale Multiple Testing with Auxiliary Information

Large-scale multiple testing is a fundamental problem in high dimensiona...
research
12/01/2021

Controlling for multiple covariates

A fundamental problem in statistics is to compare the outcomes attained ...
research
01/25/2022

NAPA: Neighborhood-Assisted and Posterior-Adjusted Two-sample Inference

Two-sample multiple testing problems of sparse spatial data are frequent...
research
11/03/2017

NeuralFDR: Learning Discovery Thresholds from Hypothesis Features

As datasets grow richer, an important challenge is to leverage the full ...
research
04/12/2021

A smoothed and probabilistic PARAFAC model with covariates

Analysis and clustering of multivariate time-series data attract growing...
research
04/27/2023

Learning Absorption Rates in Glucose-Insulin Dynamics from Meal Covariates

Traditional models of glucose-insulin dynamics rely on heuristic paramet...

Please sign up or login with your details

Forgot password? Click here to reset