Zipper: Addressing degeneracy in algorithm-agnostic inference

06/29/2023
by   Geng Chen, et al.
0

The widespread use of black box prediction methods has sparked an increasing interest in algorithm/model-agnostic approaches for quantifying goodness-of-fit, with direct ties to specification testing, model selection and variable importance assessment. A commonly used framework involves defining a predictiveness criterion, applying a cross-fitting procedure to estimate the predictiveness, and utilizing the difference in estimated predictiveness between two models as the test statistic. However, even after standardization, the test statistic typically fails to converge to a non-degenerate distribution under the null hypothesis of equal goodness, leading to what is known as the degeneracy issue. To addresses this degeneracy issue, we present a simple yet effective device, Zipper. It draws inspiration from the strategy of additional splitting of testing data, but encourages an overlap between two testing data splits in predictiveness evaluation. Zipper binds together the two overlapping splits using a slider parameter that controls the proportion of overlap. Our proposed test statistic follows an asymptotically normal distribution under the null hypothesis for any fixed slider value, guaranteeing valid size control while enhancing power by effective data reuse. Finite-sample experiments demonstrate that our procedure, with a simple choice of the slider, works well across a wide range of settings.

READ FULL TEXT
research
07/20/2020

Testing goodness-of-fit and conditional independence with approximate co-sufficient sampling

Goodness-of-fit (GoF) testing is ubiquitous in statistics, with direct t...
research
10/20/2021

U-statistic based on overlapping sample spacings

For testing goodness of fit, we consider a class of U-statistics of over...
research
05/24/2022

Robust testing to compare regression curves

This paper focuses on the problem of testing the null hypothesis that th...
research
09/05/2022

GRASP: A Goodness-of-Fit Test for Classification Learning

Performance of classifiers is often measured in terms of average accurac...
research
05/12/2023

Distribution free MMD tests for model selection with estimated parameters

Several kernel based testing procedures are proposed to solve the proble...
research
04/07/2020

A unified approach for inference on algorithm-agnostic variable importance

In many applications, it is of interest to assess the relative contribut...
research
10/06/2022

Post-selection Inference in Multiverse Analysis (PIMA): an inferential framework based on the sign flipping score test

When analyzing data researchers make some decisions that are either arbi...

Please sign up or login with your details

Forgot password? Click here to reset