Multiple competition based FDR control

07/02/2019
by   Kristen Emery, et al.
0

Competition based FDR control has been commonly used for over a decade in the computational mass spectrometry community [7]. The approach has gained significant popularity in other fields after Barber and Candés recently laid its theoretical foundation in a more general setting that, importantly, included the feature selection problem [1]. Here we consider competition based FDR control where we can generate multiple, rather than a single, competing null score. We offer several methods that can take advantage of these multiple null scores, all of which are based on a novel procedure that rigorously controls the FDR in the finite sample setting, provided its two tuning parameters are set without looking at the data. Because none of our methods clearly dominates all the others in terms of power we also develop a data driven approach, which is based on a novel resampling procedure and which tries to select the most appropriate procedure for the problem at hand. Using extensive simulations, as well as real data, we show that all our procedures seem to largely control the FDR and that our data driven approach offers an arguably overall optimal choice. Moreover, we show using real data that in the peptide detection problem our novel approach can increase the number of discovered peptides by up to 50

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2020

Competition-based control of the false discovery proportion

Target-decoy competition (TDC) is commonly used in the computational mas...
research
11/21/2019

Controlling the FDR in variable selection via multiple knockoffs

Barber and Candes recently introduced a feature selection method called ...
research
12/06/2017

Dynamic adaptive procedures for false discovery rate estimation and control

In the multiple testing problem with independent tests, the classical li...
research
06/25/2021

Semi-supervised multiple testing

An important limitation of standard multiple testing procedures is that ...
research
06/26/2020

Stable Feature Selection with Applications to MALDI Imaging Mass Spectrometry Data

This paper discusses an approach, based on the subsampling boostrap and ...
research
03/27/2023

Discovering the Network Granger Causality in Large Vector Autoregressive Models

This paper proposes novel inferential procedures for the network Granger...
research
06/18/2020

Photometric Data-driven Classification of Type Ia Supernovae in the Open Supernova Catalog

We propose a novel approach for a machine-learning-based detection of th...

Please sign up or login with your details

Forgot password? Click here to reset