A robust statistical method for Genome-wide association analysis of human copy number variation

11/15/2020
by   Han Wang, et al.
0

Conducting genome-wide association studies (GWAS) in copy number variation (CNV) level is a field where few people involves and little statistical progresses have been achieved, traditional methods suffer from many problems such as batch effects, heterogeneity across genome, leading to low power or high false discovery rate. We develop a new robust method to find disease-risking regions related to CNV's disproportionately distributed between case and control samples, even if there are batch effects between them, our test formula is robust to such effects. We propose a new empirical Bayes rule to deal with overfitting when estimating parameters during testing, this rule can be extended to the field of model selection, it can be more efficient compared with traditional methods when there are too much potential models to be specified. We also give solid theoretical guarantees for our proposed method, and demonstrate the effectiveness by simulation and realdata analysis.

READ FULL TEXT

page 5

page 6

page 11

page 16

page 23

page 32

page 34

page 35

research
09/29/2019

A Simple Yet Efficient Parametric Method of Local False Discovery Rate Estimation Designed for Genome-Wide Association Data Analysis

In genome-wide association studies (GWAS), hundreds of thousands of gene...
research
02/24/2022

Analysis of Genotype-Phenotype Association using Fields and Information Theory

We show how field- and information theory can be used to quantify the re...
research
11/02/2018

Brawn and Brains: a Robust and Powerful approach to X-inclusive Whole-genome Association Studies

X-chromosome is often excluded from whole-genome association studies due...
research
06/21/2018

Bayesian hierarchical models for SNP discovery from genome-wide association studies, a semi-supervised machine learning approach

Genome-wide association studies (GWASs) aim to detect genetic risk facto...
research
11/05/2021

Tradeoffs of Linear Mixed Models in Genome-wide Association Studies

Motivated by empirical arguments that are well-known from the genome-wid...
research
07/26/2019

Adjusting for Spatial Effects in Genomic Prediction

This paper investigates the problem of adjusting for spatial effects in ...
research
08/04/2016

Iterative Hard Thresholding for Model Selection in Genome-Wide Association Studies

A genome-wide association study (GWAS) correlates marker variation with ...

Please sign up or login with your details

Forgot password? Click here to reset