Selection-adjusted inference: an application to confidence intervals for cis-eQTL effect sizes

01/26/2018
by   Snigdha Panigrahi, et al.
0

The goal of eQTL studies is to identify the genetic variants that influence the expression levels of the genes in an organism. High throughput technology has made such studies possible: in a given tissue sample, it enables us to quantify the expression levels of approximately 20,000 genes and to record the alleles present at millions of genetic polymorphisms. While obtaining this data is relatively cheap once a specimen is at hand, obtaining human tissue remains a costly endeavor. Thus, eQTL studies continue to be based on relatively small sample sizes, with this limitation particularly serious for tissues of most immediate medical relevance. Given the high dimensional nature of this datasets and the large number of hypotheses tested, the scientific community has adopted early on multiplicity adjustment procedures, which primarily control the false discoveries rate for the identification of genetic variants with influence on the expression levels. In contrast, a problem that has not received much attention to date is that of providing estimates of the effect sizes associated to these variants, in a way that accounts for the considerable amount of selection. We illustrate how the recently developed conditional inference approach can be deployed to obtain confidence intervals for the eQTL effect sizes with reliable coverage. The procedure we propose is based on a randomized hierarchical strategy that both reflects the steps typically adopted in state of the art investigations and introduces the use of randomness instead of data splitting to maximize the use of available data. Analysis of the GTEx Liver dataset (v6) suggests that naively obtained confidence intervals would likely not cover the true values of effect sizes and that the number of local genetic polymorphisms influencing the expression level of genes might be underestimated.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/31/2023

Optimal confidence interval for the difference of proportions

Estimating the probability of the binomial distribution is a basic probl...
research
06/02/2019

Confidence Intervals for Selected Parameters

Practical or scientific considerations often lead to selecting a subset ...
research
07/24/2018

Likelihood-based meta-analysis with few studies: Empirical and simulation studies

Standard random-effects meta-analysis methods perform poorly when applie...
research
05/17/2023

Fast computation of exact confidence intervals for randomized experiments with binary outcomes

Given a randomized experiment with binary outcomes, exact confidence int...
research
11/10/2019

Inference After Selecting Plausibly Valid Instruments with Application to Mendelian Randomization

Mendelian randomization (MR) is a popular method in genetic epidemiology...
research
05/16/2014

Selection Bias Correction and Effect Size Estimation under Dependence

We consider large-scale studies in which it is of interest to test a ver...
research
10/25/2019

Boosting heritability: estimating the genetic component of phenotypic variation with multiple sample splitting

Heritability is a central measure in genetics quantifying how much of th...

Please sign up or login with your details

Forgot password? Click here to reset