A Scalable and Cloud-Native Hyperparameter Tuning System

06/03/2020
by   Johnu George, et al.
0

In this paper, we introduce Katib: a scalable, cloud-native, and production-ready hyperparameter tuning system that is agnostic of the underlying machine learning framework. Though there are multiple hyperparameter tuning systems available, this is the first one that caters to the needs of both users and administrators of the system. We present the motivation and design of the system and contrast it with existing hyperparameter tuning systems, especially in terms of multi-tenancy, scalability, fault-tolerance, and extensibility. It can be deployed on local machines, or hosted as a service in on-premise data centers, or in private/public clouds. We demonstrate the advantage of our system using experimental results as well as real-world, production use cases. Katib has active contributors from multiple companies and is open-sourced at https://github.com/kubeflow/katib under the Apache 2.0 license.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2019

Auptimizer – an Extensible, Open-Source Framework for Hyperparameter Tuning

Tuning machine learning models at scale, especially finding the right hy...
research
05/22/2020

MANGO: A Python Library for Parallel Hyperparameter Tuning

Tuning hyperparameters for machine learning algorithms is a tedious task...
research
07/25/2019

Optuna: A Next-generation Hyperparameter Optimization Framework

The purpose of this study is to introduce new design-criteria for next-g...
research
10/10/2022

PyHopper – Hyperparameter optimization

Hyperparameter tuning is a fundamental aspect of machine learning resear...
research
01/13/2023

Hyperparameter Optimization as a Service on INFN Cloud

The simplest and often most effective way of parallelizing the training ...
research
01/02/2018

M2: Malleable Metal as a Service

Existing bare-metal cloud services that provide users with physical node...
research
03/24/2021

A Multi-Tenant Framework for Cloud Container Services

Container technologies have been evolving rapidly in the cloud-native er...

Please sign up or login with your details

Forgot password? Click here to reset