Saddlepoint approximations in binary genome-wide association studies

10/08/2021
by   Pål Vegard Johnsen, et al.
0

We investigate saddlepoint approximations applied to the score test statistic in genome-wide association studies with binary phenotypes. The inaccuracy in the normal approximation of the score test statistic increases with increasing sample imbalance and with decreasing minor allele count. Applying saddlepoint approximations to the score test statistic distribution greatly improve the accuracy, even far out in the tail of the distribution. By using exact results for an intercept model and binary covariate model, as well as simulations for models with nuisance parameters, we emphasize the need for continuity corrections in order to achieve valid p-values. The performance of the saddlepoint approximations is evaluated by overall and conditional type I error rate on simulated data. We investigate the methods further by using data from UK Biobank with skin and soft tissue infections as phenotype, using both common and rare variants. The analysis confirms that continuity correction is important particularly for rare variants, and that the normal approximation gives a highly inflated type I error rate for case imbalance.

READ FULL TEXT

page 20

page 21

research
12/18/2017

Fast permutation tests and related methods, for association between rare variants and binary outcomes

In large scale genetic association studies, a primary aim is to test for...
research
02/20/2020

A Bayes Factor Approach with Informative Prior for Rare Genetic Variant Analysis from Next Generation Sequencing Data

The discovery of rare genetic variants through Next Generation Sequencin...
research
09/29/2021

A copula-based set-variant association test for bivariate continuous or mixed phenotypes

In genome wide association studies (GWAS), researchers are often dealing...
research
11/02/2020

Covariate Adaptive Family-wise Error Rate Control for Genome-Wide Association Studies

The family-wise error rate (FWER) has been widely used in genome-wide as...
research
08/10/2018

Genome-Wide Association Studies: Information Theoretic Limits of Reliable Learning

In the problems of Genome-Wide Association Study (GWAS), the objective i...
research
12/03/2021

Bayesian nonparametric strategies for power maximization in rare variants association studies

Rare variants are hypothesized to be largely responsible for heritabilit...
research
03/28/2018

Improving likelihood-based inference in control rate regression

Control rate regression is a diffuse approach to account for heterogenei...

Please sign up or login with your details

Forgot password? Click here to reset