Maximum Mean Discrepancy Meets Neural Networks: The Radon-Kolmogorov-Smirnov Test

09/05/2023
by   Seunghoon Paik, et al.
0

Maximum mean discrepancy (MMD) refers to a general class of nonparametric two-sample tests that are based on maximizing the mean difference over samples from one distribution P versus another Q, over all choices of data transformations f living in some function space ℱ. Inspired by recent work that connects what are known as functions of Radon bounded variation (RBV) and neural networks (Parhi and Nowak, 2021, 2023), we study the MMD defined by taking ℱ to be the unit ball in the RBV space of a given smoothness order k ≥ 0. This test, which we refer to as the Radon-Kolmogorov-Smirnov (RKS) test, can be viewed as a generalization of the well-known and classical Kolmogorov-Smirnov (KS) test to multiple dimensions and higher orders of smoothness. It is also intimately connected to neural networks: we prove that the witness in the RKS test – the function f achieving the maximum mean difference – is always a ridge spline of degree k, i.e., a single neuron in a neural network. This allows us to leverage the power of modern deep learning toolkits to (approximately) optimize the criterion that underlies the RKS test. We prove that the RKS test has asymptotically full power at distinguishing any distinct pair P ≠ Q of distributions, derive its asymptotic null distribution, and carry out extensive experiments to elucidate the strengths and weakenesses of the RKS test versus the more traditional kernel MMD test.

READ FULL TEXT

page 2

page 30

research
12/02/2020

Two-sample test based on maximum variance discrepancy

In this article, we introduce a novel discrepancy called the maximum var...
research
05/14/2015

Training generative neural networks via Maximum Mean Discrepancy optimization

We consider training a deep neural network to generate samples from an u...
research
06/03/2020

Learning Kernel Tests Without Data Splitting

Modern large-scale kernel-based tests such as maximum mean discrepancy (...
research
02/21/2023

Boosting the Power of Kernel Two-Sample Tests

The kernel two-sample test based on the maximum mean discrepancy (MMD) i...
research
10/22/2020

Maximum Mean Discrepancy is Aware of Adversarial Attacks

The maximum mean discrepancy (MMD) test, as a representative two-sample ...
research
09/25/2019

Classification Logit Two-sample Testing by Neural Networks

The recent success of generative adversarial networks and variational le...
research
09/20/2018

Exemplar-based synthesis of geology using kernel discrepancies and generative neural networks

We propose a framework for synthesis of geological images based on an ex...

Please sign up or login with your details

Forgot password? Click here to reset