On Explaining Machine Learning Models by Evolving Crucial and Compact Features

07/04/2019
by   Marco Virgolin, et al.
0

Feature construction can substantially improve the accuracy of Machine Learning (ML) algorithms. Genetic Programming (GP) has been proven to be effective at this task by evolving non-linear combinations of input features. GP additionally has the potential to improve ML explainability since explicit expressions are evolved. Yet, in most GP works the complexity of evolved features is not explicitly bound or minimized though this is arguably key for explainability. In this article, we assess to what extent GP still performs favorably at feature construction when constructing features that are (1) Of small-enough number, to enable visualization of the behavior of the ML model; (2) Of small-enough size, to enable interpretability of the features themselves; (3) Of sufficient informative power, to retain or even improve the performance of the ML algorithm. We consider a simple feature construction scheme using three different GP algorithms, as well as random search, to evolve features for four ML algorithms, including support vector machines and random forest. Our results on 20 datasets pertaining to classification and regression problems show that constructing only two compact features can be sufficient to rival the use of the entire original feature set. We further find that a modern GP algorithm, GP-GOMEA, performs best overall. These results, combined with examples that we provide of readable constructed features and of 2D visualizations of ML behavior, lead us to positively conclude that GP-based feature construction still works well when explicitly searching for compact features, making it extremely helpful to explain ML models.

READ FULL TEXT
research
03/24/2022

Multi-modal multi-objective model-based genetic programming to find multiple diverse high-quality models

Explainable artificial intelligence (XAI) is an important and rapidly ex...
research
05/18/2020

Applying Genetic Programming to Improve Interpretability in Machine Learning Models

Explainable Artificial Intelligence (or xAI) has become an important res...
research
09/09/2019

Machine learning for automatic construction of pseudo-realistic pediatric abdominal phantoms

Machine Learning (ML) is proving extremely beneficial in many healthcare...
research
04/03/2019

Model-based Genetic Programming with GOMEA for Symbolic Regression of Small Expressions

The Gene-pool Optimal Mixing Evolutionary Algorithm (GOMEA) has been sho...
research
08/28/2023

Buy when? Survival machine learning model comparison for purchase timing

The value of raw data is unlocked by converting it into information and ...
research
02/16/2021

An Implementation of Vector Quantization using the Genetic Algorithm Approach

The application of machine learning(ML) and genetic programming(GP) to t...

Please sign up or login with your details

Forgot password? Click here to reset