Learning Kernel Tests Without Data Splitting

06/03/2020
by   Jonas M. Kübler, et al.
4

Modern large-scale kernel-based tests such as maximum mean discrepancy (MMD) and kernelized Stein discrepancy (KSD) optimize kernel hyperparameters on a held-out sample via data splitting to obtain the most powerful test statistics. While data splitting results in a tractable null distribution, it suffers from a reduction in test power due to smaller test sample size. Inspired by the selective inference framework, we propose an approach that enables learning the hyperparameters and testing on the full sample without data splitting. Our approach can correctly calibrate the test in the presence of such dependency, and yield a test threshold in closed form. At the same significance level, our approach's test power is empirically larger than that of the data-splitting approach, regardless of its split proportion.

READ FULL TEXT
research
02/10/2021

An Optimal Witness Function for Two-Sample Testing

We propose data-dependent test statistics based on a one-dimensional wit...
research
03/02/2021

Significance tests of feature relevance for a blackbox learner

An exciting recent development is the uptake of deep learning in many sc...
research
06/14/2023

MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data Splitting

We propose novel statistics which maximise the power of a two-sample tes...
research
09/05/2023

Maximum Mean Discrepancy Meets Neural Networks: The Radon-Kolmogorov-Smirnov Test

Maximum mean discrepancy (MMD) refers to a general class of nonparametri...
research
06/14/2019

A/B Testing Measurement Framework for Recommendation Models Based on Expected Revenue

We provide a method to determine whether a new recommendation system imp...
research
12/21/2021

Data blurring: sample splitting a single sample

Suppose we observe a random vector X from some distribution P in a known...
research
10/07/2021

A Fast and Effective Large-Scale Two-Sample Test Based on Kernels

Kernel two-sample tests have been widely used and the development of eff...

Please sign up or login with your details

Forgot password? Click here to reset