Simultaneous Selection of Multiple Important Single Nucleotide Polymorphisms in Familial Genome Wide Association Studies Data

02/04/2018
by   Subhabrata Majumdar, et al.
0

We propose a resampling-based fast variable selection technique for selecting important Single Nucleotide Polymorphisms (SNP) in multi-marker mixed effect models used in twin studies. Due to computational complexity, current practice includes testing the effect of one SNP at a time, commonly termed as `single SNP association analysis'. Joint modeling of genetic variants within a gene or pathway may have better power to detect the relevant genetic variants, hence we adapt our recently proposed framework of e-values to address this. In this paper, we propose a computationally efficient approach for single SNP detection in families while utilizing information on multiple SNPs simultaneously. We achieve this through improvements in two aspects. First, unlike other model selection techniques, our method only requires training a model with all possible predictors. Second, we utilize a fast and scalable bootstrap procedure that only requires Monte-Carlo sampling to obtain bootstrapped copies of the estimated vector of coefficients. Using this bootstrap sample, we obtain the e-value for each SNP, and select SNPs having e-values below a threshold. We illustrate through numerical studies that our method is more effective in detecting SNPs associated with a trait than either single-marker analysis using family data or model selection methods that ignore the familial dependency structure. We also use the e-values to perform gene-level analysis in nuclear families and detect several SNPs that have been implicated to be associated with alcohol consumption.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2020

Gene-Environment Interaction: A Variable Selection Perspective

Gene-environment interactions have important implications to elucidate t...
research
04/09/2018

Deep Learning Classification of Polygenic Obesity using Genome Wide Association Study SNPs

In this paper, association results from genome-wide association studies ...
research
03/28/2018

BIVAS: A scalable Bayesian method for bi-level variable selection with applications

In this paper, we consider a Bayesian bi-level variable selection proble...
research
11/15/2021

An Approach of Bayesian Variable Selection for Ultrahigh Dimensional Multivariate Regression

In many practices, scientists are particularly interested in detecting w...
research
04/08/2018

eQTL Mapping via Effective SNP Ranking and Screening

Genome-wide eQTL mapping explores the relationship between gene expressi...
research
04/15/2014

Bayesian Neural Networks for Genetic Association Studies of Complex Disease

Discovering causal genetic variants from large genetic association studi...
research
01/06/2018

Utilising Deep Learning and Genome Wide Association Studies for Epistatic-Driven Preterm Birth Classification in African-American Women

Genome Wide Association Studies (GWAS) are used to identify statisticall...

Please sign up or login with your details

Forgot password? Click here to reset