Testing goodness-of-fit and conditional independence with approximate co-sufficient sampling

07/20/2020
by   Rina Foygel Barber, et al.
0

Goodness-of-fit (GoF) testing is ubiquitous in statistics, with direct ties to model selection, confidence interval construction, conditional independence testing, and multiple testing, just to name a few applications. While testing the GoF of a simple (point) null hypothesis provides an analyst great flexibility in the choice of test statistic while still ensuring validity, most GoF tests for composite null hypotheses are far more constrained, as the test statistic must have a tractable distribution over the entire null model space. A notable exception is co-sufficient sampling (CSS): resampling the data conditional on a sufficient statistic for the null model guarantees valid GoF testing using any test statistic the analyst chooses. But CSS testing requires the null model to have a compact (in an information-theoretic sense) sufficient statistic, which only holds for a very limited class of models; even for a null model as simple as logistic regression, CSS testing is powerless. In this paper, we leverage the concept of approximate sufficiency to generalize CSS testing to essentially any parametric model with an asymptotically-efficient estimator; we call our extension "approximate CSS" (aCSS) testing. We quantify the finite-sample Type I error inflation of aCSS testing and show that it is vanishing under standard maximum likelihood asymptotics, for any choice of test statistic. We apply our proposed procedure both theoretically and in simulation to a number of models of interest to demonstrate its finite-sample Type I error and power.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/23/2018

A model specification test for the variance function in nonparametric regression

The problem of testing for the parametric form of the conditional varian...
research
09/14/2023

Approximate co-sufficient sampling with regularization

In this work, we consider the problem of goodness-of-fit (GoF) testing f...
research
06/29/2023

Zipper: Addressing degeneracy in algorithm-agnostic inference

The widespread use of black box prediction methods has sparked an increa...
research
06/12/2023

Large-Scale Multiple Testing of Composite Null Hypotheses Under Heteroskedasticity

Heteroskedasticity poses several methodological challenges in designing ...
research
07/02/2020

A New ECDF Two-Sample Test Statistic

Empirical cumulative distribution functions (ECDFs) have been used to te...
research
07/31/2019

Testing for Externalities in Network Formation Using Simulation

We discuss a simplified version of the testing problem considered by Pel...
research
01/08/2019

What is the dimension of a stochastic process? Testing for the rank of a covariance operator

How can we discern whether a mean-square continuous stochastic process i...

Please sign up or login with your details

Forgot password? Click here to reset