Generalized Goodness-Of-Fit Tests for Correlated Data

06/10/2018
by   Hong Zhang, et al.
0

This paper concerns the problem of applying the generalized goodness-of-fit (gGOF) type tests for analyzing correlated data. The gGOF family broadly covers the maximum-based testing procedures by ordered input p-values, such as the false discovery rate procedure, the Kolmogorov-Smirnov type statistics, the ϕ-divergence family, etc. Data analysis framework and a novel p-value calculation approach is developed under the Gaussian mean model and the generalized linear model (GLM). We reveal the influence of data transformations to the signal-to-noise ratio and the statistical power under both sparse and dense signal patterns and various correlation structures. In particular, the innovated transformation (IT), which is shown equivalent to the marginal model-fitting under the GLM, is often preferred for detecting sparse signals in correlated data. We propose a testing strategy called the digGOF, which combines a double-adaptation procedure (i.e., adapting to both the statistic's formula and the truncation scheme of the input p-values) and the IT within the gGOF family. It features efficient computation and robust adaptation to the family-retained advantages for given data. Relevant approaches are assessed by extensive simulations and by genetic studies of Crohn's disease and amyotrophic lateral sclerosis. Computations have been included into the R package SetTest available on CRAN.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2020

Accurate p-Value Calculation for Generalized Fisher's Combination Tests Under Dependence

Combining dependent tests of significance has broad applications but the...
research
01/12/2018

TFisher Tests: Optimal and Adaptive Thresholding for Combining p-Values

For testing a group of hypotheses, tremendous p-value combination method...
research
08/09/2019

Goodness-of-fit testing in high-dimensional generalized linear models

We propose a family of tests to assess the goodness-of-fit of a high-dim...
research
10/06/2017

Set-Based Tests for Genetic Association Using the Generalized Berk-Jones Statistic

Studying the effects of groups of Single Nucleotide Polymorphisms (SNPs)...
research
06/19/2018

vsgoftest: An Package for Goodness-of-Fit Testing Based on Kullback-Leibler Divergence

The R-package vsgoftest performs goodness-of-fit (GOF) tests, based on S...
research
02/03/2023

Is the Gompertz family a good fit to your data?

That data follow a Gompertz distribution is a widely used assumption in ...
research
01/11/2018

Modeling High-Dimensional Data with Case-Control Sampling and Dependency Structures

Modern data sets in various domains often include units that were sample...

Please sign up or login with your details

Forgot password? Click here to reset