Problem-solving benefits of down-sampled lexicase selection

06/10/2021
by   Thomas Helmuth, et al.
0

In genetic programming, an evolutionary method for producing computer programs that solve specified computational problems, parent selection is ordinarily based on aggregate measures of performance across an entire training set. Lexicase selection, by contrast, selects on the basis of performance on random sequences of training cases; this has been shown to enhance problem-solving power in many circumstances. Lexicase selection can also be seen as better reflecting biological evolution, by modeling sequences of challenges that organisms face over their lifetimes. Recent work has demonstrated that the advantages of lexicase selection can be amplified by down-sampling, meaning that only a random subsample of the training cases is used each generation. This can be seen as modeling the fact that individual organisms encounter only subsets of the possible environments, and that environments change over time. Here we provide the most extensive benchmarking of down-sampled lexicase selection to date, showing that its benefits hold up to increased scrutiny. The reasons that down-sampling helps, however, are not yet fully understood. Hypotheses include that down-sampling allows for more generations to be processed with the same budget of program evaluations; that the variation of training data across generations acts as a changing environment, encouraging adaptation; or that it reduces overfitting, leading to more general solutions. We systematically evaluate these hypotheses, finding evidence against all three, and instead draw the conclusion that down-sampled lexicase selection's main benefit stems from the fact that it allows the evolutionary process to examine more individuals within the same computational budget, even though each individual is examined less completely.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/04/2023

Informed Down-Sampled Lexicase Selection: Identifying productive training cases for efficient problem solving

Genetic Programming (GP) often uses large training sets and requires all...
research
05/31/2022

The Environmental Discontinuity Hypothesis for Down-Sampled Lexicase Selection

Down-sampling training data has long been shown to improve the generaliz...
research
04/14/2023

Analyzing the Interaction Between Down-Sampling and Selection

Genetic programming systems often use large training sets to evaluate th...
research
04/30/2015

Model Selection and Overfitting in Genetic Programming: Empirical Study [Extended Version]

Genetic Programming has been very successful in solving a large area of ...
research
05/22/2019

Lexicase Selection of Specialists

Lexicase parent selection filters the population by considering one rand...
research
12/31/2020

Exploiting Transitivity for Top-k Selection with Score-Based Dueling Bandits

We consider the problem of top-k subset selection in Dueling Bandit prob...
research
05/30/2019

Epsilon-Lexicase Selection for Regression

Lexicase selection is a parent selection method that considers test case...

Please sign up or login with your details

Forgot password? Click here to reset