Analyzing the Interaction Between Down-Sampling and Selection

04/14/2023
by   Ryan Boldi, et al.
0

Genetic programming systems often use large training sets to evaluate the quality of candidate solutions for selection. However, evaluating populations on large training sets can be computationally expensive. Down-sampling training sets has long been used to decrease the computational cost of evaluation in a wide range of application domains. Indeed, recent studies have shown that both random and informed down-sampling can substantially improve problem-solving success for GP systems that use the lexicase parent selection algorithm. We use the PushGP framework to experimentally test whether these down-sampling techniques can also improve problem-solving success in the context of two other commonly used selection methods, fitness-proportionate and tournament selection, across eight GP problems (four program synthesis and four symbolic regression). We verified that down-sampling can benefit the problem-solving success of both fitness-proportionate and tournament selection. However, the number of problems wherein down-sampling improved problem-solving success varied by selection scheme, suggesting that the impact of down-sampling depends both on the problem and choice of selection scheme. Surprisingly, we found that down-sampling was most consistently beneficial when combined with lexicase selection as compared to tournament and fitness-proportionate selection. Overall, our results suggest that down-sampling should be considered more often when solving test-based GP problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/04/2023

Informed Down-Sampled Lexicase Selection: Identifying productive training cases for efficient problem solving

Genetic Programming (GP) often uses large training sets and requires all...
research
07/07/2015

Developing Postfix-GP Framework for Symbolic Regression Problems

This paper describes Postfix-GP system, postfix notation based Genetic P...
research
04/04/2023

A Static Analysis of Informed Down-Samples

We present an analysis of the loss of population-level test coverage ind...
research
06/10/2021

Problem-solving benefits of down-sampled lexicase selection

In genetic programming, an evolutionary method for producing computer pr...
research
05/31/2022

The Environmental Discontinuity Hypothesis for Down-Sampled Lexicase Selection

Down-sampling training data has long been shown to improve the generaliz...
research
06/29/2020

Applying Dynamic Training-Subset Selection Methods Using Genetic Programming for Forecasting Implied Volatility

Volatility is a key variable in option pricing, trading and hedging stra...
research
05/25/2021

Speed Benchmarking of Genetic Programming Frameworks

Genetic Programming (GP) is known to suffer from the burden of being com...

Please sign up or login with your details

Forgot password? Click here to reset