Evolvability Degeneration in Multi-Objective Genetic Programming for Symbolic Regression

02/14/2022
by   Dazhuang Liu, et al.
0

Genetic programming (GP) is one of the best approaches today to discover symbolic regression models. To find models that trade off accuracy and complexity, the non-dominated sorting genetic algorithm II (NSGA-II) is widely used. Unfortunately, it has been shown that NSGA-II can be inefficient: in early generations, low-complexity models over-replicate and take over most of the population. Consequently, studies have proposed different approaches to promote diversity. Here, we study the root of this problem, in order to design a superior approach. We find that the over-replication of low complexity-models is due to a lack of evolvability, i.e., the inability to produce offspring with improved accuracy. We therefore extend NSGA-II to track, over time, the evolvability of models of different levels of complexity. With this information, we limit how many models of each complexity level are allowed to survive the generation. We compare this new version of NSGA-II, evoNSGA-II, with the use of seven existing multi-objective GP approaches on ten widely-used data sets, and find that evoNSGA-II is equal or superior to using these approaches in almost all comparisons. Furthermore, our results confirm that evoNSGA-II behaves as intended: models that are more evolvable form the majority of the population.

READ FULL TEXT

page 13

page 14

research
06/10/2022

Highlights of Semantics in Multi-objective Genetic Programming

Semantics is a growing area of research in Genetic programming (GP) and ...
research
01/03/2019

An Improved multi-objective genetic algorithm based on orthogonal design and adaptive clustering pruning strategy

Two important characteristics of multi-objective evolutionary algorithms...
research
09/01/2021

Complexity Measures for Multi-objective Symbolic Regression

Multi-objective symbolic regression has the advantage that while the acc...
research
07/20/2021

Using Shape Constraints for Improving Symbolic Regression Models

We describe and analyze algorithms for shape-constrained symbolic regres...
research
11/24/2011

A GP-MOEA/D Approach for Modelling Total Electron Content over Cyprus

Vertical Total Electron Content (vTEC) is an ionospheric characteristic ...
research
05/06/2021

Semantics in Multi-objective Genetic Programming

Semantics has become a key topic of research in Genetic Programming (GP)...
research
03/24/2022

Multi-modal multi-objective model-based genetic programming to find multiple diverse high-quality models

Explainable artificial intelligence (XAI) is an important and rapidly ex...

Please sign up or login with your details

Forgot password? Click here to reset