Controlling FDR in selecting group-level simultaneous signals from multiple data sources with application to the National Covid Collaborative Cohort data

03/02/2023
by   Runqiu Wang, et al.
0

One challenge in exploratory association studies using observational data is that the signals are potentially weak and the features have complex correlation structures. False discovery rate (FDR) controlling procedures can provide important statistical guarantees for replicability in risk factor identification in exploratory research. In the recently established National COVID Collaborative Cohort (N3C), electronic health record (EHR) data on the same set of candidate features are independently collected in multiple different sites, offering opportunities to identify signals by combining information from different sources. This paper presents a general knockoff-based variable selection algorithm to identify mutual signals from unions of group-level conditional independence tests with exact FDR control guarantees under finite sample settings. This algorithm can work with general regression settings, allowing heterogeneity of both the predictors and the outcomes across multiple data sources. We demonstrate the performance of this method with extensive numerical studies and an application to the N3C data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2021

Multiple Testing for Composite Null with FDR Control Guarantee

False discovery rate (FDR) controlling procedures provide important stat...
research
08/24/2021

A Generalized Knockoff Procedure for FDR Control in Structural Change Detection

Controlling false discovery rate (FDR) is crucial for variable selection...
research
04/21/2023

Joint Mirror Procedure: Controlling False Discovery Rate for Identifying Simultaneous Signals

In many applications, identifying a single feature of interest requires ...
research
11/21/2019

Controlling the FDR in variable selection via multiple knockoffs

Barber and Candes recently introduced a feature selection method called ...
research
01/11/2018

Robust inference with knockoffs

We consider the variable selection problem, which seeks to identify impo...
research
02/10/2021

Bayesian Knockoff Filter Using Gibbs Sampler

In many fields, researchers are interested in discovering features with ...
research
09/28/2022

Consensus Knowledge Graph Learning via Multi-view Sparse Low Rank Block Model

Network analysis has been a powerful tool to unveil relationships and in...

Please sign up or login with your details

Forgot password? Click here to reset