Layered TPOT: Speeding up Tree-based Pipeline Optimization

01/18/2018
by   Pieter Gijsbers, et al.
0

With the demand for machine learning increasing, so does the demand for tools which make it easier to use. Automated machine learning (AutoML) tools have been developed to address this need, such as the Tree-Based Pipeline Optimization Tool (TPOT) which uses genetic programming to build optimal pipelines. We introduce Layered TPOT, a modification to TPOT which aims to create pipelines equally good as the original, but in significantly less time. This approach evaluates candidate pipelines on increasingly large subsets of the data according to their fitness, using a modified evolutionary algorithm to allow for separate competition between pipelines trained on different sample sizes. Empirical evaluation shows that, on sufficiently large datasets, Layered TPOT indeed finds better models faster.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/20/2016

Evaluation of a Tree-based Pipeline Optimization Tool for Automating Data Science

As the field of data science continues to grow, there will be an ever-in...
research
02/01/2023

Faster Convergence with Lexicase Selection in Tree-based Automated Machine Learning

In many evolutionary computation systems, parent selection methods can a...
research
01/28/2016

Automating biomedical data science through tree-based pipeline optimization

Over the past decade, data science and machine learning has grown from a...
research
02/18/2022

SapientML: Synthesizing Machine Learning Pipelines by Learning from Human-Written Solutions

Automatic machine learning, or AutoML, holds the promise of truly democr...
research
03/02/2018

Autostacker: A Compositional Evolutionary Learning System

We introduce an automatic machine learning (AutoML) modeling architectur...
research
06/02/2023

Automating Pipelines of A/B Tests with Population Split Using Self-Adaptation and Machine Learning

A/B testing is a common approach used in industry to facilitate innovati...
research
02/06/2017

Toward the automated analysis of complex diseases in genome-wide association studies using genetic programming

Machine learning has been gaining traction in recent years to meet the d...

Please sign up or login with your details

Forgot password? Click here to reset