A Population-Aware Retrospective Regression to Detect Genome-Wide Variants with Sex Difference in Allele Frequency

12/23/2022
by   Zhong Wang, et al.
0

Sex difference in allele frequency is an emerging topic that is critical to our understanding of ascertainment bias, as well as data quality particularly of the largely overlooked X chromosome. To detect sex difference in allele frequency for both X chromosomal and autosomal variants, existing methods are conservative when applied to samples from multiple ancestral populations, such as African and European populations. Additionally, it remains unexplored whether the sex difference in allele frequency differs between populations, which is important to trans-ancestral genetic studies. We thus developed a novel retrospective regression-based testing framework to provide interpretable and easy-to-implement solutions to answer these questions. We then applied the proposed methods to the high-coverage whole genome sequence data of the 1000 Genomes Project, robustly analyzing all samples available from the five super-populations. We had 76 novel findings by recognizing and modeling ancestral differences.

READ FULL TEXT

page 17

page 20

research
06/27/2023

High-dimensional statistical inference for linkage disequilibrium score regression and its cross-ancestry extensions

Linkage disequilibrium score regression (LDSC) has emerged as an essenti...
research
03/23/2022

Estimating trans-ancestry genetic correlation with unbalanced data resources

The aim of this paper is to propose a novel estimation method of using g...
research
05/10/2022

Improving genetic risk prediction across diverse population by disentangling ancestry representations

Risk prediction models using genetic data have seen increasing traction ...
research
04/29/2019

Genome analysis and pleiotropy assessment using causal networks with loss of function mutation and metabolomics

Background: Many genome-wide association studies have detected genomic r...
research
05/31/2013

Joint Modeling and Registration of Cell Populations in Cohorts of High-Dimensional Flow Cytometric Data

In systems biomedicine, an experimenter encounters different potential s...
research
11/28/2022

Optimal-k difference sequence in nonparametric regression

Difference-based methods have been attracting increasing attention in no...
research
08/19/2021

Transfer learning in genome-wide association studies with knockoffs

This paper presents and compares alternative transfer learning methods t...

Please sign up or login with your details

Forgot password? Click here to reset