Gaussian Processes with Errors in Variables: Theory and Computation

by   Shuang Zhou, et al.

Covariate measurement error in nonparametric regression is a common problem in nutritional epidemiology and geostatistics, and other fields. Over the last two decades, this problem has received substantial attention in the frequentist literature. Bayesian approaches for handling measurement error have only been explored recently and are surprisingly successful, although the lack of a proper theoretical justification regarding the asymptotic performance of the estimators. By specifying a Gaussian process prior on the regression function and a Dirichlet process Gaussian mixture prior on the unknown distribution of the unobserved covariates, we show that the posterior distribution of the regression function and the unknown covariates density attain optimal rates of contraction adaptively over a range of Hölder classes, up to logarithmic terms. This improves upon the existing classical frequentist results which require knowledge of the smoothness of the underlying function to deliver optimal risk bounds. We also develop a novel surrogate prior for approximating the Gaussian process prior that leads to efficient computation and preserves the covariance structure, thereby facilitating easy prior elicitation. We demonstrate the empirical performance of our approach and compare it with competitors in a wide range of simulation experiments and a real data example.



page 14

page 17

page 35

page 36

page 37

page 38


Convergence of Gaussian Process Regression with Estimated Hyper-parameters and Applications in Bayesian Inverse Problems

This work is concerned with the convergence of Gaussian process regressi...

Propriety of the reference posterior distribution in Gaussian Process regression

In a seminal article, Berger, De Oliveira and Sansó (2001) compare sever...

Dimension adaptability of Gaussian process models with variable selection and projection

It is now known that an extended Gaussian process model equipped with re...

Regression modelling with I-priors

We introduce the I-prior methodology as a unifying framework for estimat...

Frequentist coverage and sup-norm convergence rate in Gaussian process regression

Gaussian process (GP) regression is a powerful interpolation technique d...

Mixtures of Gaussian Processes for regression under multiple prior distributions

When constructing a Bayesian Machine Learning model, we might be faced w...

Posterior Convergence of Gaussian and General Stochastic Process Regression Under Possible Misspecifications

In this article, we investigate posterior convergence in nonparametric r...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.