Simple Simultaneous Ensemble Learning in Genetic Programming

09/13/2020
by   Marco Virgolin, et al.
0

Learning ensembles by bagging can substantially improve the generalization performance of low-bias high-variance estimators, including those evolved by Genetic Programming (GP). Yet, the best way to learn ensembles in GP remains to be determined. This work attempts to fill the gap between existing GP ensemble learning algorithms, which are often either simple but expensive, or efficient but complex. We propose a new algorithm that is both simple and efficient, named Simple Simultaneous Ensemble Genetic Programming (2SEGP). 2SEGP is obtained by relatively minor modifications to fitness evaluation and selection of a classic GP algorithm, and its only drawback is an (arguably small) increase of the fitness evaluation cost from the classic 𝒪(n ℓ) to 𝒪(n(ℓ + β)), with n the number of observations and ℓ/β the estimator/ensemble size. Experimental comparisons on real-world datasets between supervised classification and regression show that, despite its simplicity, 2SEGP fares very well against state-of-the-art (ensemble and not) GP algorithms. We further provide insights into what matters in 2SEGP by (i) scaling β, (ii) ablating the proposed selection method, (iii) observing the evolvability induced by traditional subtree variation.

READ FULL TEXT
research
01/21/2020

Ensemble Genetic Programming

Ensemble learning is a powerful paradigm that has been usedin the top st...
research
01/23/2018

Pruning Techniques for Mixed Ensembles of Genetic Programming Models

The objective of this paper is to define an effective strategy for build...
research
04/03/2019

Model-based Genetic Programming with GOMEA for Symbolic Regression of Small Expressions

The Gene-pool Optimal Mixing Evolutionary Algorithm (GOMEA) has been sho...
research
04/03/2019

A Model-based Genetic Programming Approach for Symbolic Regression of Small Expressions

The Gene-pool Optimal Mixing Evolutionary Algorithm (GOMEA) is a model-b...
research
04/11/2017

Improving Fitness Functions in Genetic Programming for Classification on Unbalanced Credit Card Datasets

Credit card fraud detection based on machine learning has recently attra...
research
09/19/2018

Exploiting Tournament Selection for Efficient Parallel Genetic Programming

Genetic Programming (GP) is a computationally intensive technique which ...
research
06/29/2020

Applying Dynamic Training-Subset Selection Methods Using Genetic Programming for Forecasting Implied Volatility

Volatility is a key variable in option pricing, trading and hedging stra...

Please sign up or login with your details

Forgot password? Click here to reset