Semiparametric efficient estimation of genetic relatedness with double machine learning

04/04/2023
by   Xu Guo, et al.
0

In this paper, we propose double machine learning procedures to estimate genetic relatedness between two traits in a model-free framework. Most existing methods require specifying certain parametric models involving the traits and genetic variants. However, the bias due to model mis-specification may yield misleading statistical results. Moreover, the semiparametric efficient bounds for estimators of genetic relatedness are still lacking. In this paper, we develop semi-parametric efficient and model-free estimators and construct valid confidence intervals for two important measures of genetic relatedness: genetic covariance and genetic correlation, allowing both continuous and discrete responses. Based on the derived efficient influence functions of genetic relatedness, we propose a consistent estimator of the genetic covariance as long as one of genetic values is consistently estimated. The data of two traits may be collected from the same group or different groups of individuals. Various numerical studies are performed to illustrate our introduced procedures. We also apply proposed procedures to analyze Carworth Farms White mice genome-wide association study data.

READ FULL TEXT
research
06/08/2021

A Unified Approach to Robust Inference for Genetic Covariance

Genome-wide association studies (GWAS) have identified thousands of gene...
research
03/04/2019

On genetic correlation estimation with summary statistics from genome-wide association studies

Genome-wide association studies (GWAS) have been widely used to examine ...
research
10/22/2020

Object-Attribute Biclustering for Elimination of Missing Genotypes in Ischemic Stroke Genome-Wide Data

Missing genotypes can affect the efficacy of machine learning approaches...
research
02/10/2014

Genomic Prediction of Quantitative Traits using Sparse and Locally Epistatic Models

In plant and animal breeding studies a distinction is made between the g...
research
10/21/2022

Comparison of REML methods for the study of phenome-wide genetic variation

It is now well documented that genetic covariance between functionally r...
research
03/22/2022

On block-wise and reference panel-based estimators for genetic data prediction in high dimensions

Genetic prediction of complex traits and diseases has attracted enormous...
research
07/02/2020

Floodgate: inference for model-free variable importance

Many modern applications seek to understand the relationship between an ...

Please sign up or login with your details

Forgot password? Click here to reset