Selection Bias Correction and Effect Size Estimation under Dependence

05/16/2014
by   Kean Ming Tan, et al.
0

We consider large-scale studies in which it is of interest to test a very large number of hypotheses, and then to estimate the effect sizes corresponding to the rejected hypotheses. For instance, this setting arises in the analysis of gene expression or DNA sequencing data. However, naive estimates of the effect sizes suffer from selection bias, i.e., some of the largest naive estimates are large due to chance alone. Many authors have proposed methods to reduce the effects of selection bias under the assumption that the naive estimates of the effect sizes are independent. Unfortunately, when the effect size estimates are dependent, these existing techniques can have very poor performance, and in practice there will often be dependence. We propose an estimator that adjusts for selection bias under a recently-proposed frequentist framework, without the independence assumption. We study some properties of the proposed estimator, and illustrate that it outperforms past proposals in a simulation study and on two gene expression data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2013

On Estimating Many Means, Selection Bias, and the Bootstrap

With recent advances in high throughput technology, researchers often fi...
research
04/30/2019

Estimating Proportion of True Null Hypotheses based on Sum of p-values and application to microarrays

A new estimator of proportion of true null hypotheses based on sum of al...
research
11/06/2022

An empirical likelihood approach to reduce selection bias in voluntary samples

We address the weighting problem in voluntary samples under a nonignorab...
research
02/17/2020

Estimating the number and effect sizes of non-null hypotheses

We study the problem of estimating the distribution of effect sizes (the...
research
04/10/2023

A Framework for Understanding Selection Bias in Real-World Healthcare Data

Using administrative patient-care data such as Electronic Health Records...
research
02/23/2018

Nonparametric Estimation of a distribution function from doubly truncated data under dependence

The NPMLE of a distribution function from doubly truncated data was intr...
research
01/26/2018

Selection-adjusted inference: an application to confidence intervals for cis-eQTL effect sizes

The goal of eQTL studies is to identify the genetic variants that influe...

Please sign up or login with your details

Forgot password? Click here to reset