Learning Deep Kernels for Non-Parametric Two-Sample Tests

02/21/2020
by   Feng Liu, et al.
17

We propose a class of kernel-based two-sample tests, which aim to determine whether two sets of samples are drawn from the same distribution. Our tests are constructed from kernels parameterized by deep neural nets, trained to maximize test power. These tests adapt to variations in distribution smoothness and shape over space, and are especially suited to high dimensions and complex data. By contrast, the simpler kernels used in prior kernel testing work are spatially homogeneous, and adaptive only in lengthscale. We explain how this scheme includes popular classifier-based two-sample tests as a special case, but improves on them in general. We provide the first proof of consistency for the proposed adaptation method, which applies both to kernels on deep features and to simpler radial basis kernels or multiple kernel learning. In experiments, we establish the superior performance of our deep kernels in hypothesis testing on benchmark and real-world data. The code of our deep-kernel-based two sample tests is available at https://github.com/fengliu90/DK-for-TST.

READ FULL TEXT

page 24

page 25

page 27

research
09/19/2019

Comparing distributions: ℓ_1 geometry improves kernel two-sample testing

Are two sets of observations drawn from the same distribution? This prob...
research
06/14/2021

Meta Two-Sample Testing: Learning Kernels for Testing with Limited Data

Modern kernel-based two-sample tests have shown great success in disting...
research
10/28/2021

MMD Aggregated Two-Sample Test

We propose a novel nonparametric two-sample test based on the Maximum Me...
research
06/14/2023

MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data Splitting

We propose novel statistics which maximise the power of a two-sample tes...
research
06/14/2018

The Exact Equivalence of Distance and Kernel Methods for Hypothesis Testing

Distance-based methods, also called "energy statistics", are leading met...
research
08/17/2023

Kernel-Based Tests for Likelihood-Free Hypothesis Testing

Given n observations from two balanced classes, consider the task of lab...
research
06/20/2022

Multiple Testing Framework for Out-of-Distribution Detection

We study the problem of Out-of-Distribution (OOD) detection, that is, de...

Please sign up or login with your details

Forgot password? Click here to reset