DeepAI AI Chat
Log In Sign Up

Pruning Techniques for Mixed Ensembles of Genetic Programming Models

by   Mauro Castelli, et al.
Università degli Studi di Milano-Bicocca

The objective of this paper is to define an effective strategy for building an ensemble of Genetic Programming (GP) models. Ensemble methods are widely used in machine learning due to their features: they average out biases, they reduce the variance and they usually generalize better than single models. Despite these advantages, building ensemble of GP models is not a well-developed topic in the evolutionary computation community. To fill this gap, we propose a strategy that blends individuals produced by standard syntax-based GP and individuals produced by geometric semantic genetic programming, one of the newest semantics-based method developed in GP. In fact, recent literature showed that combining syntax and semantics could improve the generalization ability of a GP model. Additionally, to improve the diversity of the GP models used to build up the ensemble, we propose different pruning criteria that are based on correlation and entropy, a commonly used measure in information theory. Experimental results,obtained over different complex problems, suggest that the pruning criteria based on correlation and entropy could be effective in improving the generalization ability of the ensemble model and in reducing the computational burden required to build it.


page 1

page 2

page 3

page 4


A Survey on Techniques of Improving Generalization Ability of Genetic Programming Solutions

In the field of empirical modeling using Genetic Programming (GP), it is...

Ensemble Genetic Programming

Ensemble learning is a powerful paradigm that has been usedin the top st...

Improving Generalization Ability of Genetic Programming: Comparative Study

In the field of empirical modeling using Genetic Programming (GP), it is...

Simple Simultaneous Ensemble Learning in Genetic Programming

Learning ensembles by bagging can substantially improve the generalizati...

Semantic-based Distance Approaches in Multi-objective Genetic Programming

Semantics in the context of Genetic Program (GP) can be understood as th...

How Noisy Data Affects Geometric Semantic Genetic Programming

Noise is a consequence of acquiring and pre-processing data from the env...