Faster Convergence with Lexicase Selection in Tree-based Automated Machine Learning

02/01/2023
by   Nicholas Matsumoto, et al.
0

In many evolutionary computation systems, parent selection methods can affect, among other things, convergence to a solution. In this paper, we present a study comparing the role of two commonly used parent selection methods in evolving machine learning pipelines in an automated machine learning system called Tree-based Pipeline Optimization Tool (TPOT). Specifically, we demonstrate, using experiments on multiple datasets, that lexicase selection leads to significantly faster convergence as compared to NSGA-II in TPOT. We also compare the exploration of parts of the search space by these selection methods using a trie data structure that contains information about the pipelines explored in a particular run.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2018

Layered TPOT: Speeding up Tree-based Pipeline Optimization

With the demand for machine learning increasing, so does the demand for ...
research
01/26/2021

Incremental Search Space Construction for Machine Learning Pipeline Synthesis

Automated machine learning (AutoML) aims for constructing machine learni...
research
10/29/2021

A Scalable AutoML Approach Based on Graph Neural Networks

AutoML systems build machine learning models automatically by performing...
research
01/28/2016

Automating biomedical data science through tree-based pipeline optimization

Over the past decade, data science and machine learning has grown from a...
research
12/18/2022

AutoSlicer: Scalable Automated Data Slicing for ML Model Analysis

Automated slicing aims to identify subsets of evaluation data where a tr...
research
11/27/2021

TPOT-SH: a Faster Optimization Algorithm to Solve the AutoML Problem on Large Datasets

Data are omnipresent nowadays and contain knowl- edge and patterns that...
research
02/25/2019

Quantifying error contributions of computational steps, algorithms and hyperparameter choices in image classification pipelines

Data science relies on pipelines that are organized in the form of inter...

Please sign up or login with your details

Forgot password? Click here to reset