MMD Aggregated Two-Sample Test

10/28/2021
by   Antonin Schrab, et al.
3

We propose a novel nonparametric two-sample test based on the Maximum Mean Discrepancy (MMD), which is constructed by aggregating tests with different kernel bandwidths. This aggregation procedure, called MMDAgg, ensures that test power is maximised over the collection of kernels used, without requiring held-out data for kernel selection (which results in a loss of test power), or arbitrary kernel choices such as the median heuristic. We work in the non-asymptotic framework, and prove that our aggregated test is minimax adaptive over Sobolev balls. Our guarantees are not restricted to a specific kernel, but hold for any product of one-dimensional translation invariant characteristic kernels which are absolutely and square integrable. Moreover, our results apply for popular numerical procedures to determine the test threshold, namely permutations and the wild bootstrap. Through numerical experiments on both synthetic and real-world datasets, we demonstrate that MMDAgg outperforms alternative state-of-the-art approaches to MMD kernel adaptation for two-sample testing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2022

KSD Aggregated Goodness-of-fit Test

We investigate properties of goodness-of-fit tests based on the Kernel S...
research
02/21/2023

Boosting the Power of Kernel Two-Sample Tests

The kernel two-sample test based on the maximum mean discrepancy (MMD) i...
research
08/25/2020

A Kernel Two-Sample Test for Functional Data

We propose a nonparametric two-sample test procedure based on Maximum Me...
research
09/22/2015

Graph Kernels exploiting Weisfeiler-Lehman Graph Isomorphism Test Extensions

In this paper we present a novel graph kernel framework inspired the by ...
research
02/21/2020

Learning Deep Kernels for Non-Parametric Two-Sample Tests

We propose a class of kernel-based two-sample tests, which aim to determ...
research
06/14/2023

MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data Splitting

We propose novel statistics which maximise the power of a two-sample tes...
research
10/22/2020

Maximum Mean Discrepancy is Aware of Adversarial Attacks

The maximum mean discrepancy (MMD) test, as a representative two-sample ...

Please sign up or login with your details

Forgot password? Click here to reset