Latency-Aware Neural Architecture Search with Multi-Objective Bayesian Optimization

06/22/2021
by   David Eriksson, et al.
10

When tuning the architecture and hyperparameters of large machine learning models for on-device deployment, it is desirable to understand the optimal trade-offs between on-device latency and model accuracy. In this work, we leverage recent methodological advances in Bayesian optimization over high-dimensional search spaces and multi-objective Bayesian optimization to efficiently explore these trade-offs for a production-scale on-device natural language understanding model at Facebook.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2022

Fair and Green Hyperparameter Optimization via Multi-objective and Multiple Information Source Bayesian Optimization

There is a consensus that focusing only on accuracy in searching for opt...
research
02/02/2023

Bayesian Optimization of Multiple Objectives with Different Latencies

Multi-objective Bayesian optimization aims to find the Pareto front of o...
research
10/13/2022

Computer-Aided Multi-Objective Optimization in Small Molecule Discovery

Molecular discovery is a multi-objective optimization problem that requi...
research
06/01/2023

Efficient and Robust Bayesian Selection of Hyperparameters in Dimension Reduction for Visualization

We introduce an efficient and robust auto-tuning framework for hyperpara...
research
07/13/2022

Dynamic Selection of Perception Models for Robotic Control

Robotic perception models, such as Deep Neural Networks (DNNs), are beco...
research
01/25/2023

Towards Mobility Management with Multi-Objective Bayesian Optimization

One of the consequences of network densification is more frequent handov...
research
08/16/2023

BREATHE: Second-Order Gradients and Heteroscedastic Emulation based Design Space Exploration

Researchers constantly strive to explore larger and more complex search ...

Please sign up or login with your details

Forgot password? Click here to reset