The Promises and Pitfalls of Deep Kernel Learning

02/24/2021
by   Sebastian W. Ober, et al.
0

Deep kernel learning and related techniques promise to combine the representational power of neural networks with the reliable uncertainty estimates of Gaussian processes. One crucial aspect of these models is an expectation that, because they are treated as Gaussian process models optimized using the marginal likelihood, they are protected from overfitting. However, we identify pathological behavior, including overfitting, on a simple toy example. We explore this pathology, explaining its origins and considering how it applies to real datasets. Through careful experimentation on UCI datasets, CIFAR-10, and the UTKFace dataset, we find that the overfitting from overparameterized deep kernel learning, in which the model is "somewhat Bayesian", can in certain scenarios be worse than that from not being Bayesian at all. However, we find that a fully Bayesian treatment of deep kernel learning can rectify this overfitting and obtain the desired performance improvements over standard neural networks and Gaussian processes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/19/2023

Guided Deep Kernel Learning

Combining Gaussian processes with the expressive power of deep neural ne...
research
10/01/2021

Conditional Deep Gaussian Processes: empirical Bayes hyperdata learning

It is desirable to combine the expressive power of deep learning with Ga...
research
06/29/2018

Bayesian Deep Learning on a Quantum Computer

Bayesian methods in machine learning, such as Gaussian processes, have g...
research
06/10/2018

Building Bayesian Neural Networks with Blocks: On Structure, Interpretability and Uncertainty

We provide simple schemes to build Bayesian Neural Networks (BNNs), bloc...
research
11/11/2015

Training Deep Gaussian Processes using Stochastic Expectation Propagation and Probabilistic Backpropagation

Deep Gaussian processes (DGPs) are multi-layer hierarchical generalisati...
research
08/11/2022

Gaussian process surrogate models for neural networks

The lack of insight into deep learning systems hinders their systematic ...
research
10/04/2020

Deep kernel processes

We define deep kernel processes in which positive definite Gram matrices...

Please sign up or login with your details

Forgot password? Click here to reset