Transfer Automatic Machine Learning

03/07/2018
by   Catherine Wong, et al.
0

Building effective neural networks requires many design choices. These include the network topology, optimization procedure, regularization, stability methods, and choice of pre-trained parameters. This design is time consuming and requires expert input. Automatic Machine Learning aims automate this process using hyperparameter optimization. However, automatic model building frameworks optimize performance on each task independently, whereas human experts leverage prior knowledge when designing a new network. We propose Transfer Automatic Machine Learning, a method to accelerate network design using knowledge of prior tasks. For this, we build upon reinforcement learning architecture design methods to support parallel training on multiple tasks and transfer the search strategy to new tasks. Tested on NLP and Image classification tasks, Transfer Automatic Machine Learning reduces convergence time over single-task methods by almost an order of magnitude on 13 out of 14 tasks. It achieves better test set accuracy on 10 out of 13 tasks NLP tasks and improves performance on CIFAR-10 image recognition from 95.3

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2017

Transfer Learning to Learn with Multitask Neural Model Search

Deep learning models require extensive architecture design exploration a...
research
05/25/2021

Transfer Learning and Curriculum Learning in Sokoban

Transfer learning can speed up training in machine learning and is regul...
research
05/24/2019

Automatic Machine Learning by Pipeline Synthesis using Model-Based Reinforcement Learning and a Grammar

Automatic machine learning is an important problem in the forefront of m...
research
12/05/2018

Learning to Design Circuits

Analog IC design relies on human experts to search for parameters that s...
research
11/14/2020

TDAsweep: A Novel Dimensionality Reduction Method for Image Classification Tasks

One of the most celebrated achievements of modern machine learning techn...
research
11/03/2021

AlphaD3M: Machine Learning Pipeline Synthesis

We introduce AlphaD3M, an automatic machine learning (AutoML) system bas...
research
12/13/2020

Warm Starting CMA-ES for Hyperparameter Optimization

Hyperparameter optimization (HPO), formulated as black-box optimization ...

Please sign up or login with your details

Forgot password? Click here to reset