End-to-End Learning of Deep Kernel Acquisition Functions for Bayesian Optimization

11/01/2021
by   Tomoharu Iwata, et al.
0

For Bayesian optimization (BO) on high-dimensional data with complex structure, neural network-based kernels for Gaussian processes (GPs) have been used to learn flexible surrogate functions by the high representation power of deep learning. However, existing methods train neural networks by maximizing the marginal likelihood, which do not directly improve the BO performance. In this paper, we propose a meta-learning method for BO with neural network-based kernels that minimizes the expected gap between the true optimum value and the best value found by BO. We model a policy, which takes the current evaluated data points as input and outputs the next data point to be evaluated, by a neural network, where neural network-based kernels, GPs, and mutual information-based acquisition functions are used as its layers. With our model, the neural network-based kernel is trained to be appropriate for the acquisition function by backpropagating the gap through the acquisition function and GP. Our model is trained by a reinforcement learning framework from multiple tasks. Since the neural network is shared across different tasks, we can gather knowledge on BO from multiple training tasks, and use the knowledge for unseen test tasks. In experiments using three text document datasets, we demonstrate that the proposed method achieves better BO performance than the existing methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2019

Deep Probabilistic Kernels for Sample-Efficient Learning

Gaussian Processes (GPs) with an appropriate kernel are known to provide...
research
04/10/2020

Reinforcement Learning via Gaussian Processes with Neural Network Dual Kernels

While deep neural networks (DNNs) and Gaussian Processes (GPs) are both ...
research
04/04/2019

Meta-Learning Acquisition Functions for Bayesian Optimization

Many practical applications of machine learning require data-efficient b...
research
10/09/2020

Few-shot Learning for Spatial Regression

We propose a few-shot learning method for spatial regression. Although G...
research
04/19/2021

Few-shot Learning for Topic Modeling

Topic models have been successfully used for analyzing text documents. H...
research
07/31/2020

Rethinking PointNet Embedding for Faster and Compact Model

PointNet, which is the widely used point-wise embedding method and known...
research
06/12/2018

Differentiable Compositional Kernel Learning for Gaussian Processes

The generalization properties of Gaussian processes depend heavily on th...

Please sign up or login with your details

Forgot password? Click here to reset