Active Learning in Genetic Programming: Guiding Efficient Data Collection for Symbolic Regression

07/31/2023
by   Nathan Haut, et al.
0

This paper examines various methods of computing uncertainty and diversity for active learning in genetic programming. We found that the model population in genetic programming can be exploited to select informative training data points by using a model ensemble combined with an uncertainty metric. We explored several uncertainty metrics and found that differential entropy performed the best. We also compared two data diversity metrics and found that correlation as a diversity metric performs better than minimum Euclidean distance, although there are some drawbacks that prevent correlation from being used on all problems. Finally, we combined uncertainty and diversity using a Pareto optimization approach to allow both to be considered in a balanced way to guide the selection of informative and unique data points for training.

READ FULL TEXT

page 13

page 17

page 19

research
09/08/2021

Active Learning by Acquiring Contrastive Examples

Common acquisition functions for active learning use either uncertainty ...
research
02/09/2022

Active Learning Improves Performance on Symbolic RegressionTasks in StackGP

In this paper we introduce an active learning method for symbolic regres...
research
01/07/2021

Diminishing Uncertainty within the Training Pool: Active Learning for Medical Image Segmentation

Active learning is a unique abstraction of machine learning techniques w...
research
04/12/2013

Modified Soft Brood Crossover in Genetic Programming

Premature convergence is one of the important issues while using Genetic...
research
05/16/2017

Ensemble of heterogeneous flexible neural trees using multiobjective genetic programming

Machine learning algorithms are inherently multiobjective in nature, whe...
research
02/03/2019

Online Diversity Control in Symbolic Regression via a Fast Hash-based Tree Similarity Measure

Diversity represents an important aspect of genetic programming, being d...
research
03/28/2018

Active Metric Learning for Supervised Classification

Clustering and classification critically rely on distance metrics that p...

Please sign up or login with your details

Forgot password? Click here to reset