Optimal P-value Weighting with Independent Information

12/19/2017
by   Mohamad S. Hasan, et al.
0

The large-scale multiple testing inherent to high throughput biological data necessitates very high statistical stringency and thus true effects in data are difficult to detect unless they have high effect sizes. One solution to this problem is to use an independent information to prioritize the most promising features of the data and thus increase the power to detect them. Weighted p-values provide a general framework for doing this in a statistically rigorous fashion. However, calculating weights that incorporate the independent information and optimize statistical power remains a challenging problem despite recent advances in this area. Existing methods tend to perform poorly in the common situation that true positive features are rare and of low effect size. We introduce covariate based weighting methods for calculating optimal weights conditioned on the effect sizes of the tests. This approach uses the probabilistic relationship between covariate and test effect size to calculate more informative weights that are not diluted by null effects as is common with group-based methods. This relationship can be calculated theoretically for normally distributed covariates or estimated empirically in other cases. We showed via simulations and applications to data that this method outperforms existing methods by a large margin in the rare/low effect size scenario and has at least comparable performance in all scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/11/2022

Optimal Covariate Weighting Increases Discoveries in High-throughput Biology

The large-scale multiple testing inherent to high throughput biological ...
research
12/03/2015

A New Statistical Framework for Genetic Pleiotropic Analysis of High Dimensional Phenotype Data

The widely used genetic pleiotropic analysis of multiple phenotypes are ...
research
12/13/2022

Wilcoxon-Mann-Whitney Effects for Clustered Data: Informative Cluster Size

In clustered data setting, informative cluster size has been a focus of ...
research
01/17/2023

Multiple imputation for propensity score analysis with covariates missing at random: some clarity on within and across methods

In epidemiology and social sciences, propensity score methods are popula...
research
10/14/2019

Robust Importance Weighting for Covariate Shift

In many learning problems, the training and testing data follow differen...
research
11/15/2020

Nonparametric goodness-of-fit testing for parametric covariate models in pharmacometric analyses

The characterization of covariate effects on model parameters is a cruci...
research
09/02/2021

A Phylogeny-based Test of Mediation Effect in Microbiome

Recent studies suggest that the microbiome can be an important mediator ...

Please sign up or login with your details

Forgot password? Click here to reset