Multi-source Deep Gaussian Process Kernel Learning

02/07/2020
by   Chi-Ken Lu, et al.
56

For many problems, relevant data are plentiful but explicit knowledge is not. Predictions about target variables may be informed by data sources that are noisy but plentiful, or data which the target variable is merely some function of. Intrepretable and flexible machine learning methods capable of fusing data across sources are lacking. We generalize the Deep Gaussian Processes so that GPs in intermediate layers can represent the posterior distribution summarizing the data from a related source. We model the prior-posterior stacking DGP with a single GP. The exact second moment of DGP is calculated analytically, and is taken as the kernel function for GP. The result is a kernel that captures effective correlation through function composition, reflects the structure of the observations from other data sources, and can be used to inform prediction based on limited direct observations. Therefore, the approximation of prior-posterior DGP can be considered a novel kernel composition which blends the kernels in different layers and have explicit dependence on the data. We consider two synthetic multi-source prediction problems: a) predicting a target variable that is merely a function of the source data and b) predicting noise-free data using a kernel trained on noisy data. Our method produces better prediction and tighter uncertainty on the synthetic data when comparing with standard GP and other DGP method, suggesting that our data-informed approximate DGPs are a powerful tool for integrating data across sources.

READ FULL TEXT
research
12/21/2020

Learning Compositional Sparse Gaussian Processes with a Shrinkage Prior

Choosing a proper set of kernel functions is an important problem in lea...
research
07/01/2013

Dimensionality Detection and Integration of Multiple Data Sources via the GP-LVM

The Gaussian Process Latent Variable Model (GP-LVM) is a non-linear prob...
research
05/12/2021

A new framework for experimental design using Bayesian Evidential Learning: the case of wellhead protection area

In this contribution, we predict the wellhead protection area (WHPA, tar...
research
10/01/2021

Conditional Deep Gaussian Processes: empirical Bayes hyperdata learning

It is desirable to combine the expressive power of deep learning with Ga...
research
06/11/2021

Measuring the sensitivity of Gaussian processes to kernel choice

Gaussian processes (GPs) are used to make medical and scientific decisio...
research
12/08/2022

GenSyn: A Multi-stage Framework for Generating Synthetic Microdata using Macro Data Sources

Individual-level data (microdata) that characterizes a population, is es...
research
09/21/2023

Stochastic stiffness identification and response estimation of Timoshenko beams via physics-informed Gaussian processes

Machine learning models trained with structural health monitoring data h...

Please sign up or login with your details

Forgot password? Click here to reset