Heterogeneity-aware integrative analyses for ancestry-specific association studies

06/08/2023
by   Aaron J. Molstad, et al.
0

Ancestry-specific proteome-wide association studies (PWAS) based on genetically predicted protein expression can reveal complex disease etiology specific to certain ancestral groups. These studies require ancestry-specific models for protein expression as a function of SNP genotypes. In order to improve protein expression prediction in ancestral populations historically underrepresented in genomic studies, we propose a new penalized maximum likelihood estimator for fitting ancestry-specific joint protein quantitative trait loci models. Our estimator borrows information across ancestral groups, while simultaneously allowing for heterogeneous error variances and regression coefficients. We propose an alternative parameterization of our model which makes the objective function convex and the penalty scale invariant. To improve computational efficiency, we propose an approximate version of our method and study its theoretical properties. Our method provides a substantial improvement in protein expression prediction accuracy in individuals of African ancestry, and in a downstream PWAS analysis, leads to the discovery of multiple associations between protein expression and blood lipid traits in the African ancestry population.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/23/2020

A covariance-enhanced approach to multi-tissue joint eQTL mapping with application to transcriptome-wide association studies

Transcriptome-wide association studies based on genetically predicted ge...
research
05/07/2020

Improving supervised prediction of aging-related genes via dynamic network analysis

This study focuses on supervised prediction of aging-related genes from ...
research
07/25/2022

A unified quantile framework reveals nonlinear heterogeneous transcriptome-wide associations

Transcriptome-wide association studies (TWAS) are powerful tools for ide...
research
03/23/2022

Estimating trans-ancestry genetic correlation with unbalanced data resources

The aim of this paper is to propose a novel estimation method of using g...
research
02/01/2022

AlphaDesign: A graph protein design method and benchmark on AlphaFoldDB

While DeepMind has tentatively solved protein folding, its inverse probl...
research
08/31/2018

Predicting protein inter-residue contacts using composite likelihood maximization and deep learning

Accurate prediction of inter-residue contacts of a protein is important ...

Please sign up or login with your details

Forgot password? Click here to reset