GRASP: A Goodness-of-Fit Test for Classification Learning

09/05/2022
by   Adel Javanmard, et al.
0

Performance of classifiers is often measured in terms of average accuracy on test data. Despite being a standard measure, average accuracy fails in characterizing the fit of the model to the underlying conditional law of labels given the features vector (Y|X), e.g. due to model misspecification, over fitting, and high-dimensionality. In this paper, we consider the fundamental problem of assessing the goodness-of-fit for a general binary classifier. Our framework does not make any parametric assumption on the conditional law Y|X, and treats that as a black box oracle model which can be accessed only through queries. We formulate the goodness-of-fit assessment problem as a tolerance hypothesis testing of the form H_0: 𝔼[D_f( Bern(η(X)) Bern(η̂(X)))]≤τ , where D_f represents an f-divergence function, and η(x), η̂(x) respectively denote the true and an estimate likelihood for a feature vector x admitting a positive label. We propose a novel test, called for testing H_0, which works in finite sample settings, no matter the features (distribution-free). We also propose model-X designed for model-X settings where the joint distribution of the features vector is known. Model-X uses this distributional information to achieve better power. We evaluate the performance of our tests through extensive numerical experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2023

A Model-free Closeness-of-influence Test for Features in Supervised Learning

Understanding the effect of a feature vector x ∈ℝ^d on the response valu...
research
10/31/2017

Asymptotically Distribution-Free Goodness-of-Fit Testing for Copulas

Consider a random sample from a continuous multivariate distribution fun...
research
06/29/2023

Zipper: Addressing degeneracy in algorithm-agnostic inference

The widespread use of black box prediction methods has sparked an increa...
research
12/02/2021

Bayesian supervised predictive classification and hypothesis testing toolkit for partition exchangeability

Bayesian supervised predictive classifiers, hypothesis testing, and para...
research
05/23/2023

Goodness of fit by Neyman-Pearson testing

The Neyman-Pearson strategy for hypothesis testing can be employed for g...
research
02/20/2018

Actively Avoiding Nonsense in Generative Models

A generative model may generate utter nonsense when it is fit to maximiz...
research
01/30/2023

Active Sequential Two-Sample Testing

Two-sample testing tests whether the distributions generating two sample...

Please sign up or login with your details

Forgot password? Click here to reset