Optimal Estimation of Simultaneous Signals Using Absolute Inner Product with Applications to Integrative Genomics

01/24/2018
by   T. Tony Cai, et al.
0

Integrating the summary statistics from genome-wide association study (GWAS) and expression quantitative trait loci (eQTL) data provides a powerful way of identifying the genes whose expression levels are causally associated with complex diseases. A parameter that quantifies the genetic sharing (colocalisation) between disease phenotype and gene expression of a given gene based on the summary statistics is first introduced based on the mean values of two Gaussian sequences. Specifically, given two independent samples X∼ N(θ, I_n) and Y∼ N(μ, I_n), the parameter of interest is T(θ, μ)=n^-1∑_i=1^n |θ_i|· |μ_i|, a non-smooth functional, which characterizes the degree of shared signals between two absolute normal mean vectors |θ| and |μ|. Using approximation theory and Hermite polynomials, a sparse absolute colocalisation estimator (SpACE) is constructed and shown to be minimax rate optimal over sparse parameter spaces. Our simulation demonstrates that the proposed estimates out-perform other naive methods, resulting in smaller estimation errors. In addition, the methods are robust to the presence of block-wise correlated observations due to linkage equilibrium. The method is applied to an integrative analysis of heart failure genomics data sets and identifies several genes and biological pathways that are possibly causal to human heart failure.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/19/2022

Inference of nonlinear causal effects with GWAS summary data

Large-scale genome-wide association studies (GWAS) have offered an excit...
research
10/06/2017

Set-Based Tests for Genetic Association Using the Generalized Berk-Jones Statistic

Studying the effects of groups of Single Nucleotide Polymorphisms (SNPs)...
research
11/12/2018

Prediction of Alzheimer's disease-associated genes by integration of GWAS summary data and expression data

Alzheimer's disease is the most common cause of dementia. It is the fift...
research
04/30/2018

Joint Analysis of Individual-level and Summary-level GWAS Data by Leveraging Pleiotropy

A large number of recent genome-wide association studies (GWASs) for com...
research
04/29/2019

Genome analysis and pleiotropy assessment using causal networks with loss of function mutation and metabolomics

Background: Many genome-wide association studies have detected genomic r...
research
03/04/2019

On genetic correlation estimation with summary statistics from genome-wide association studies

Genome-wide association studies (GWAS) have been widely used to examine ...

Please sign up or login with your details

Forgot password? Click here to reset