Bayesian Kernel Two-Sample Testing

02/13/2020
by   Qinyi Zhang, et al.
0

In modern data analysis, nonparametric measures of discrepancies between random variables are particularly important. The subject is well-studied in the frequentist literature, while the development in the Bayesian setting is limited where applications are often restricted to univariate cases. Here, we propose a Bayesian kernel two-sample testing procedure based on modelling the difference between kernel mean embeddings in the reproducing kernel Hilbert space utilising the framework established by Flaxman et al (2016). The use of kernel methods enables its application to random variables in generic domains beyond the multivariate Euclidean spaces. The proposed procedure results in a posterior inference scheme that allows an automatic selection of the kernel parameters relevant to the problem at hand. In a series of synthetic experiments and two real data experiments (i.e. testing network heterogeneity from high-dimensional data and six-membered monocyclic ring conformation comparison), we illustrate the advantages of our approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2015

Computing Functions of Random Variables via Reproducing Kernel Hilbert Space Representations

We describe a method to perform functional operations on probability dis...
research
01/27/2021

Reproducing kernel Hilbert C*-module and kernel mean embeddings

Kernel methods have been among the most popular techniques in machine le...
research
03/07/2016

Bayesian Learning of Kernel Embeddings

Kernel methods are one of the mainstays of machine learning, but the pro...
research
08/04/2015

Adaptivity and Computation-Statistics Tradeoffs for Kernel and Distance based High Dimensional Two Sample Testing

Nonparametric two sample testing is a decision theoretic problem that in...
research
08/27/2020

The linear conditional expectation in Hilbert space

The linear conditional expectation (LCE) provides a best linear (or rath...
research
12/01/2019

On the optimality of kernels for high-dimensional clustering

This paper studies the optimality of kernel methods in high-dimensional ...
research
02/17/2018

Nonparametric Testing under Random Projection

A common challenge in nonparametric inference is its high computational ...

Please sign up or login with your details

Forgot password? Click here to reset