Ensemble representation learning: an analysis of fitness and survival for wrapper-based genetic programming methods

03/20/2017
by   William La Cava, et al.
0

Recently we proposed a general, ensemble-based feature engineering wrapper (FEW) that was paired with a number of machine learning methods to solve regression problems. Here, we adapt FEW for supervised classification and perform a thorough analysis of fitness and survival methods within this framework. Our tests demonstrate that two fitness metrics, one introduced as an adaptation of the silhouette score, outperform the more commonly used Fisher criterion. We analyze survival methods and demonstrate that ϵ-lexicase survival works best across our test problems, followed by random survival which outperforms both tournament and deterministic crowding. We conduct a benchmark comparison to several classification methods using a large set of problems and show that FEW can improve the best classifier performance in several cases. We show that FEW generates consistent, meaningful features for a biomedical problem with different ML pairings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2021

Survival stacking: casting survival analysis as a classification problem

While there are many well-developed data science methods for classificat...
research
02/23/2023

A Statistical Learning Take on the Concordance Index for Survival Analysis

The introduction of machine learning (ML) techniques to the field of sur...
research
05/18/2020

Optimal survival trees ensemble

Recent studies have adopted an approach of selecting accurate and divers...
research
07/25/2023

Reinterpreting survival analysis in the universal approximator age

Survival analysis is an integral part of the statistical toolbox. Howeve...
research
04/17/2020

"Perchance to dream?": Assessing effect of dispersal strategies on the fitness of expanding populations

Unraveling patterns of animals' movements is important for understanding...
research
05/31/2013

Wavelet feature extraction and genetic algorithm for biomarker detection in colorectal cancer data

Biomarkers which predict patient's survival can play an important role i...
research
10/21/2022

Integrated Brier Score based Survival Cobra – A regression based approach

Recently Goswami et al. <cit.> introduced two novel implementations of c...

Please sign up or login with your details

Forgot password? Click here to reset