Local Bandwidth Estimation via Mixture of Gaussian Processes

02/27/2019
by   Danny Panknin, et al.
6

Real world data often exhibit inhomogeneity - complexity of the target function, noise level, etc. are not uniform over the input space. We address the issue of estimating locally optimal kernel bandwidth as a way to describe inhomogeneity. Estimated kernel bandwidths can be used not only for improving the regression/classification performance, but also for Bayesian optimization and active learning, i.e., we need more samples in the region where the function complexity and the noise level are higher. Our method, called kernel mixture of kernel experts regression (KMKER) follows the concept of mixture of experts, which is constituted of several complementary inference models, the so called experts, where in advance a latent classifier, called the gate, predicts the best fitting expert for each test input to infer. For the experts we implement Gaussian process regression models at different (global) bandwidths and a multinomial kernel logistic regression model as the gate. The basic idea behind mixture of experts is, that several distinct ground truth functions over a joint input space drive the observations, which one may want to disentangle. Each expert is meant to model one of the incompatible functions such that each expert needs its individual set of hyperparameters. We differ from that idea in the sense that we assume only one ground truth function which however exhibits spacially inhomogeneous behavior. Under these assumptions we share the hyperparameters among the experts keeping their number constant. We compare KMKER to previous methods (which cope with inhomogeneity but do not provide the optimal bandwidth estimator) on artificial and benchmark data and analyze its performance and capability for interpretation on datasets from quantum chemistry. We also demonstrate how KMKER can be applied for automatic adaptive grid selection in fluid dynamics simulations.

READ FULL TEXT

page 19

page 21

page 23

page 24

research
11/09/2020

Bayesian bandwidth estimation for local linear fitting in nonparametric regression models

This paper presents a Bayesian sampling approach to bandwidth estimation...
research
04/26/2023

Mixtures of Gaussian process experts based on kernel stick-breaking processes

Mixtures of Gaussian process experts is a class of models that can simul...
research
02/14/2021

Healing Products of Gaussian Processes

Gaussian processes (GPs) are nonparametric Bayesian models that have bee...
research
04/16/2022

FKreg: A MATLAB toolbox for fast Multivariate Kernel Regression

Kernel smooth is the most fundamental technique for data density and reg...
research
10/17/2017

Deep Gaussian Covariance Network

The correlation length-scale next to the noise variance are the most use...
research
08/14/2020

Continuous Optimization Benchmarks by Simulation

Benchmark experiments are required to test, compare, tune, and understan...
research
10/31/2021

Phase-type mixture-of-experts regression for loss severities

The task of modeling claim severities is addressed when data is not cons...

Please sign up or login with your details

Forgot password? Click here to reset