Trees Assembling Mann Whitney Approach for Detecting Genome-wide Joint Association among Low Marginal Effect loci

05/05/2015
by   Changshuai Wei, et al.
0

Common complex diseases are likely influenced by the interplay of hundreds, or even thousands, of genetic variants. Converging evidence shows that genetic variants with low marginal effects (LME) play an important role in disease development. Despite their potential significance, discovering LME genetic variants and assessing their joint association on high dimensional data (e.g., genome wide association studies) remain a great challenge. To facilitate joint association analysis among a large ensemble of LME genetic variants, we proposed a computationally efficient and powerful approach, which we call Trees Assembling Mann whitney (TAMW). Through simulation studies and an empirical data application, we found that TAMW outperformed multifactor dimensionality reduction (MDR) and the likelihood ratio based Mann whitney approach (LRMW) when the underlying complex disease involves multiple LME loci and their interactions. For instance, in a simulation with 20 interacting LME loci, TAMW attained a higher power (power=0.931) than both MDR (power=0.599) and LRMW (power=0.704). In an empirical study of 29 known Crohn's disease (CD) loci, TAMW also identified a stronger joint association with CD than those detected by MDR and LRMW. Finally, we applied TAMW to Wellcome Trust CD GWAS to conduct a genome wide analysis. The analysis of 459K single nucleotide polymorphisms was completed in 40 hours using parallel computing, and revealed a joint association predisposing to CD (p-value=2.763e-19). Further analysis of the newly discovered association suggested that 13 genes, such as ATG16L1 and LACC1, may play an important role in CD pathophysiological and etiological processes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2019

Genome-wide Causation Studies of Complex Diseases

Despite significant progress in dissecting the genetic architecture of c...
research
11/10/2012

Efficient network-guided multi-locus association mapping with graph cuts

As an increasing number of genome-wide association studies reveal the li...
research
03/29/2022

Power and Sample Size Computation for Genetic Association Studies of Binary Traits: Accounting for Covariate Effects

Power and sample size computation plays an important role in the design ...
research
01/15/2013

An Efficient Sufficient Dimension Reduction Method for Identifying Genetic Variants of Clinical Significance

Fast and cheaper next generation sequencing technologies will generate u...
research
10/01/2022

Federated Generalized Linear Mixed Models for Collaborative Genome-wide Association Studies

As the sequencing costs are decreasing, there is great incentive to perf...
research
01/04/2018

Generalized Similarity U: A Non-parametric Test of Association Based on Similarity

Second generation sequencing technologies are being increasingly used fo...

Please sign up or login with your details

Forgot password? Click here to reset