Scalable Meta-Learning for Bayesian Optimization

02/06/2018
by   Matthias Feurer, et al.
0

Bayesian optimization has become a standard technique for hyperparameter optimization, including data-intensive models such as deep neural networks that may take days or weeks to train. We consider the setting where previous optimization runs are available, and we wish to use their results to warm-start a new optimization run. We develop an ensemble model that can incorporate the results of past optimization runs, while avoiding the poor scaling that comes with putting all results into a single Gaussian process model. The ensemble combines models from past runs according to estimates of their generalization performance on the current optimization. Results from a large collection of hyperparameter optimization benchmark problems and from optimization of a production computer vision platform at Facebook show that the ensemble can substantially reduce the time it takes to obtain near-optimal configurations, and is useful for warm-starting expensive searches or running quick re-optimizations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/11/2019

Bayesian Hyperparameter Optimization with BoTorch, GPyTorch and Ax

Deep learning models are full of hyperparameters, which are set manually...
research
01/05/2018

Combination of Hyperband and Bayesian Optimization for Hyperparameter Optimization in Deep Learning

Deep learning has achieved impressive results on many problems. However,...
research
11/06/2018

Fast Hyperparameter Optimization of Deep Neural Networks via Ensembling Multiple Surrogates

The performance of deep neural networks crucially depends on good hyperp...
research
05/23/2016

Fast Bayesian Optimization of Machine Learning Hyperparameters on Large Datasets

Bayesian optimization has become a successful tool for hyperparameter op...
research
04/04/2022

Deep-Ensemble-Based Uncertainty Quantification in Spatiotemporal Graph Neural Networks for Traffic Forecasting

Deep-learning-based data-driven forecasting methods have produced impres...
research
04/25/2023

Bayesian Optimization Meets Self-Distillation

Bayesian optimization (BO) has contributed greatly to improving model pe...
research
01/11/2012

Distance-Based Bias in Model-Directed Optimization of Additively Decomposable Problems

For many optimization problems it is possible to define a distance metri...

Please sign up or login with your details

Forgot password? Click here to reset