On genetic correlation estimation with summary statistics from genome-wide association studies

03/04/2019
by   Bingxin Zhao, et al.
0

Genome-wide association studies (GWAS) have been widely used to examine the association between single nucleotide polymorphisms (SNPs) and complex traits, where both the sample size n and the number of SNPs p can be very large. Recently, cross-trait polygenic risk score (PRS) method has gained extremely popular for assessing genetic correlation of complex traits based on GWAS summary statistics (e.g., SNP effect size). However, empirical evidence has shown a common bias phenomenon that even highly significant cross-trait PRS can only account for a very small amount of genetic variance (R^2 often <1 aim of this paper is to develop a novel and powerful method to address the bias phenomenon of cross-trait PRS. We theoretically show that the estimated genetic correlation is asymptotically biased towards zero when complex traits are highly polygenic/omnigenic. When all p SNPs are used to construct PRS, we show that the asymptotic bias of PRS estimator is independent of the unknown number of causal SNPs m. We propose a consistent PRS estimator to correct such asymptotic bias. We also develop a novel estimator of genetic correlation which is solely based on two sets of GWAS summary statistics. In addition, we investigate whether or not SNP screening by GWAS p-values can lead to improved estimation and show the effect of overlapping samples among GWAS. Our results may help demystify and tackle the puzzling "missing genetic overlap" phenomenon of cross-trait PRS for dissecting the genetic similarity of closely related heritable traits. We illustrate the finite sample performance of our bias-corrected PRS estimator by using both numerical experiments and the UK Biobank data, in which we assess the genetic correlation between brain white matter tracts and neuropsychiatric disorders.

READ FULL TEXT
research
11/21/2018

Distinguishing correlation from causation using genome-wide association studies

Genome-wide association studies (GWAS) have emerged as a rich source of ...
research
03/23/2022

Estimating trans-ancestry genetic correlation with unbalanced data resources

The aim of this paper is to propose a novel estimation method of using g...
research
04/04/2023

Semiparametric efficient estimation of genetic relatedness with double machine learning

In this paper, we propose double machine learning procedures to estimate...
research
09/19/2017

Accurate Genomic Prediction Of Human Height

We construct genomic predictors for heritable and extremely complex huma...
research
03/29/2022

Power and Sample Size Computation for Genetic Association Studies of Binary Traits: Accounting for Covariate Effects

Power and sample size computation plays an important role in the design ...
research
02/21/2023

Breaking the Winner's Curse in Mendelian Randomization: Rerandomized Inverse Variance Weighted Estimator

Developments in genome-wide association studies and the increasing avail...
research
01/24/2018

Optimal Estimation of Simultaneous Signals Using Absolute Inner Product with Applications to Integrative Genomics

Integrating the summary statistics from genome-wide association study (G...

Please sign up or login with your details

Forgot password? Click here to reset