Pruning Techniques for Mixed Ensembles of Genetic Programming Models

01/23/2018
by   Mauro Castelli, et al.
0

The objective of this paper is to define an effective strategy for building an ensemble of Genetic Programming (GP) models. Ensemble methods are widely used in machine learning due to their features: they average out biases, they reduce the variance and they usually generalize better than single models. Despite these advantages, building ensemble of GP models is not a well-developed topic in the evolutionary computation community. To fill this gap, we propose a strategy that blends individuals produced by standard syntax-based GP and individuals produced by geometric semantic genetic programming, one of the newest semantics-based method developed in GP. In fact, recent literature showed that combining syntax and semantics could improve the generalization ability of a GP model. Additionally, to improve the diversity of the GP models used to build up the ensemble, we propose different pruning criteria that are based on correlation and entropy, a commonly used measure in information theory. Experimental results,obtained over different complex problems, suggest that the pruning criteria based on correlation and entropy could be effective in improving the generalization ability of the ensemble model and in reducing the computational burden required to build it.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2012

A Survey on Techniques of Improving Generalization Ability of Genetic Programming Solutions

In the field of empirical modeling using Genetic Programming (GP), it is...
research
01/21/2020

Ensemble Genetic Programming

Ensemble learning is a powerful paradigm that has been usedin the top st...
research
04/13/2013

Improving Generalization Ability of Genetic Programming: Comparative Study

In the field of empirical modeling using Genetic Programming (GP), it is...
research
09/13/2020

Simple Simultaneous Ensemble Learning in Genetic Programming

Learning ensembles by bagging can substantially improve the generalizati...
research
03/06/2021

Machine Learning versus Mathematical Model to Estimate the Transverse Shear Stress Distribution in a Rectangular Channel

One of the most important subjects of hydraulic engineering is the relia...
research
06/08/2021

GSGP-CUDA – a CUDA framework for Geometric Semantic Genetic Programming

Geometric Semantic Genetic Programming (GSGP) is a state-of-the-art mach...
research
09/25/2020

Semantic-based Distance Approaches in Multi-objective Genetic Programming

Semantics in the context of Genetic Program (GP) can be understood as th...

Please sign up or login with your details

Forgot password? Click here to reset