Comparison of REML methods for the study of phenome-wide genetic variation

10/21/2022
by   Damian Pavlyshyn, et al.
0

It is now well documented that genetic covariance between functionally related traits leads to an uneven distribution of genetic variation across multivariate trait combinations, and possibly a large part of phenotype-space that is inaccessible to evolution. How the size of this nearly-null genetic space translates to the broader phenome level is unknown. High dimensional phenotype data to address these questions are now within reach, however, incorporating these data into genetic analyses remains a challenge. Multi-trait genetic analyses, of more than a handful of traits, are slow and often fail to converge when fit with REML. This makes it challenging to estimate the genetic covariance (𝐆) underlying thousands of traits, let alone study its properties. We present a previously proposed REML algorithm that is feasible for high dimensional genetic studies in the specific setting of a balanced nested half-sib design, common of quantitative genetics. We show that it substantially outperforms other common approaches when the number of traits is large, and we use it to investigate the bias in estimated eigenvalues of 𝐆 and the size of the nearly-null genetic subspace. We show that the high-dimensional biases observed are qualitatively similar to those substantiated by asymptotic approximation in a simpler setting of a sample covariance matrix based on i.i.d. vector observation, and that interpreting the estimated size of the nearly-null genetic subspace requires considerable caution in high-dimensional studies of genetic variation. Our results provide the foundation for future research characterizing the asymptotic approximation of estimated genetic eigenvalues, and a statistical null distribution for phenome-wide studies of genetic variation.

READ FULL TEXT
research
03/23/2022

Estimating trans-ancestry genetic correlation with unbalanced data resources

The aim of this paper is to propose a novel estimation method of using g...
research
06/08/2021

A Unified Approach to Robust Inference for Genetic Covariance

Genome-wide association studies (GWAS) have identified thousands of gene...
research
06/27/2023

High-dimensional statistical inference for linkage disequilibrium score regression and its cross-ancestry extensions

Linkage disequilibrium score regression (LDSC) has emerged as an essenti...
research
05/17/2019

ACE of Space: Estimating Genetic Components of High-Dimensional Imaging Data

It is of great interest to quantify the contributions of genetic variati...
research
12/16/2021

How to estimate heritability, a guide for epidemiologists

Traditionally, heritability has been estimated using family-based method...
research
04/04/2023

Semiparametric efficient estimation of genetic relatedness with double machine learning

In this paper, we propose double machine learning procedures to estimate...
research
03/22/2022

On block-wise and reference panel-based estimators for genetic data prediction in high dimensions

Genetic prediction of complex traits and diseases has attracted enormous...

Please sign up or login with your details

Forgot password? Click here to reset