A Sparse Graph-Structured Lasso Mixed Model for Genetic Association with Confounding Correction

11/11/2017
by   Wenting Ye, et al.
0

While linear mixed model (LMM) has shown a competitive performance in correcting spurious associations raised by population stratification, family structures, and cryptic relatedness, more challenges are still to be addressed regarding the complex structure of genotypic and phenotypic data. For example, geneticists have discovered that some clusters of phenotypes are more co-expressed than others. Hence, a joint analysis that can utilize such relatedness information in a heterogeneous data set is crucial for genetic modeling. We proposed the sparse graph-structured linear mixed model (sGLMM) that can incorporate the relatedness information from traits in a dataset with confounding correction. Our method is capable of uncovering the genetic associations of a large number of phenotypes together while considering the relatedness of these phenotypes. Through extensive simulation experiments, we show that the proposed model outperforms other existing approaches and can model correlation from both population structure and shared signals. Further, we validate the effectiveness of sGLMM in the real-world genomic dataset on two different species from plants and humans. In Arabidopsis thaliana data, sGLMM behaves better than all other baseline models for 63.4 the potential causal genetic variation of Human Alzheimer's disease discovered by our model and justify some of the most important genetic loci.

READ FULL TEXT

page 3

page 16

research
08/12/2021

Understanding the population structure correction regression

Although genome-wide association studies (GWAS) on complex traits have a...
research
11/13/2008

A Multivariate Regression Approach to Association Analysis of Quantitative Trait Network

Many complex disease syndromes such as asthma consist of a large number ...
research
10/29/2017

A Fast, Accurate Two-Step Linear Mixed Model for Genetic Analysis Applied to Repeat MRI Measurements

Large-scale biobanks are being collected around the world in efforts to ...
research
05/03/2012

A powerful and efficient set test for genetic markers that handles confounders

Approaches for testing sets of variants, such as a set of rare or common...
research
09/12/2017

Identifying Genetic Risk Factors via Sparse Group Lasso with Group Graph Structure

Genome-wide association studies (GWA studies or GWAS) investigate the re...
research
04/26/2013

Supervised Heterogeneous Multiview Learning for Joint Association Study and Disease Diagnosis

Given genetic variations and various phenotypical traits, such as Magnet...
research
02/10/2014

Genomic Prediction of Quantitative Traits using Sparse and Locally Epistatic Models

In plant and animal breeding studies a distinction is made between the g...

Please sign up or login with your details

Forgot password? Click here to reset