The Mahalanobis kernel for heritability estimation in genome-wide association studies: fixed-effects and random-effects methods

01/09/2019
by   Ruijun Ma, et al.
0

Linear mixed models (LMMs) are widely used for heritability estimation in genome-wide association studies (GWAS). In standard approaches to heritability estimation with LMMs, a genetic relationship matrix (GRM) must be specified. In GWAS, the GRM is frequently a correlation matrix estimated from the study population's genotypes, which corresponds to a normalized Euclidean distance kernel. In this paper, we show that reliance on the Euclidean distance kernel contributes to several unresolved modeling inconsistencies in heritability estimation for GWAS. These inconsistencies can cause biased heritability estimates in the presence of linkage disequilibrium (LD), depending on the distribution of causal variants. We show that these biases can be resolved (at least at the modeling level) if one adopts a Mahalanobis distance-based GRM for LMM analysis. Additionally, we propose a new definition of partitioned heritability -- the heritability attributable to a subset of genes or single nucleotide polymorphisms (SNPs) -- using the Mahalanobis GRM, and show that it inherits many of the nice consistency properties identified in our original analysis. Partitioned heritability is a relatively new area for GWAS analysis, where inconsistency issues related to LD have previously been known to be especially pernicious.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2018

Distinguishing correlation from causation using genome-wide association studies

Genome-wide association studies (GWAS) have emerged as a rich source of ...
research
11/05/2021

Tradeoffs of Linear Mixed Models in Genome-wide Association Studies

Motivated by empirical arguments that are well-known from the genome-wid...
research
10/01/2022

Federated Generalized Linear Mixed Models for Collaborative Genome-wide Association Studies

As the sequencing costs are decreasing, there is great incentive to perf...
research
02/24/2022

Analysis of Genotype-Phenotype Association using Fields and Information Theory

We show how field- and information theory can be used to quantify the re...
research
06/27/2023

High-dimensional statistical inference for linkage disequilibrium score regression and its cross-ancestry extensions

Linkage disequilibrium score regression (LDSC) has emerged as an essenti...
research
07/10/2020

High heritability does not imply accurate prediction under the small additive effects hypothesis

Genome-Wide Association Studies (GWAS) explain only a small fraction of ...

Please sign up or login with your details

Forgot password? Click here to reset