A New Statistical Framework for Genetic Pleiotropic Analysis of High Dimensional Phenotype Data

12/03/2015
by   Panpan Wang, et al.
0

The widely used genetic pleiotropic analysis of multiple phenotypes are often designed for examining the relationship between common variants and a few phenotypes. They are not suited for both high dimensional phenotypes and high dimensional genotype (next-generation sequencing) data. To overcome these limitations, we develop sparse structural equation models (SEMs) as a general framework for a new paradigm of genetic analysis of multiple phenotypes. To incorporate both common and rare variants into the analysis, we extend the traditional multivariate SEMs to sparse functional SEMs. To deal with high dimensional phenotype and genotype data, we employ functional data analysis and the alternative direction methods of multiplier (ADMM) techniques to reduce data dimension and improve computational efficiency. Using large scale simulations we showed that the proposed methods have higher power to detect true causal genetic pleiotropic structure than other existing methods. Simulations also demonstrate that the gene-based pleiotropic analysis has higher power than the single variant-based pleiotropic analysis. The proposed method is applied to exome sequence data from the NHLBI Exome Sequencing Project (ESP) with 11 phenotypes, which identifies a network with 137 genes connected to 11 phenotypes and 341 edges. Among them, 114 genes showed pleiotropic genetic effects and 45 genes were reported to be associated with phenotypes in the analysis or other cardiovascular disease (CVD) related phenotypes in the literature.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2014

A General Statistic Framework for Genome-based Disease Risk Prediction

Advances of modern sensing and sequencing technologies generate a deluge...
research
01/15/2013

An Efficient Sufficient Dimension Reduction Method for Identifying Genetic Variants of Clinical Significance

Fast and cheaper next generation sequencing technologies will generate u...
research
04/08/2018

eQTL Mapping via Effective SNP Ranking and Screening

Genome-wide eQTL mapping explores the relationship between gene expressi...
research
12/19/2017

Optimal P-value Weighting with Independent Information

The large-scale multiple testing inherent to high throughput biological ...
research
06/27/2023

A new classification framework for high-dimensional data

Classification is a classic problem but encounters lots of challenges wh...
research
12/12/2018

Association Analysis of Common and Rare SNVs using Adaptive Fisher Method to Detect Dense and Sparse Signals

The development of next generation sequencing (NGS) technology and genot...

Please sign up or login with your details

Forgot password? Click here to reset