How Noisy Data Affects Geometric Semantic Genetic Programming

07/04/2017
by   Luis F. Miranda, et al.
0

Noise is a consequence of acquiring and pre-processing data from the environment, and shows fluctuations from different sources---e.g., from sensors, signal processing technology or even human error. As a machine learning technique, Genetic Programming (GP) is not immune to this problem, which the field has frequently addressed. Recently, Geometric Semantic Genetic Programming (GSGP), a semantic-aware branch of GP, has shown robustness and high generalization capability. Researchers believe these characteristics may be associated with a lower sensibility to noisy data. However, there is no systematic study on this matter. This paper performs a deep analysis of the GSGP performance over the presence of noise. Using 15 synthetic datasets where noise can be controlled, we added different ratios of noise to the data and compared the results obtained with those of a canonical GP. The results show that, as we increase the percentage of noisy instances, the generalization performance degradation is more pronounced in GSGP than GP. However, in general, GSGP is more robust to noise than GP in the presence of up to 10 noise, and presents no statistical difference for values higher than that in the test bed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/07/2021

Solving classification problems using Traceless Genetic Programming

Traceless Genetic Programming (TGP) is a new Genetic Programming (GP) th...
research
06/08/2021

GSGP-CUDA – a CUDA framework for Geometric Semantic Genetic Programming

Geometric Semantic Genetic Programming (GSGP) is a state-of-the-art mach...
research
04/13/2013

Improving Generalization Ability of Genetic Programming: Comparative Study

In the field of empirical modeling using Genetic Programming (GP), it is...
research
08/31/2023

TurboGP: A flexible and advanced python based GP library

We introduce TurboGP, a Genetic Programming (GP) library fully written i...
research
01/30/2020

SGP-DT: Semantic Genetic Programming Based on Dynamic Targets

Semantic GP is a promising approach that introduces semantic awareness d...
research
06/29/2020

Dynamic Hedging using Generated Genetic Programming Implied Volatility Models

The purpose of this paper is to improve the accuracy of dynamic hedging ...
research
03/06/2021

Machine Learning versus Mathematical Model to Estimate the Transverse Shear Stress Distribution in a Rectangular Channel

One of the most important subjects of hydraulic engineering is the relia...

Please sign up or login with your details

Forgot password? Click here to reset