Multi-Objective Optimization of Performance and Interpretability of Tabular Supervised Machine Learning Models

07/17/2023
by   Lennart Schneider, et al.
0

We present a model-agnostic framework for jointly optimizing the predictive performance and interpretability of supervised machine learning models for tabular data. Interpretability is quantified via three measures: feature sparsity, interaction sparsity of features, and sparsity of non-monotone feature effects. By treating hyperparameter optimization of a machine learning algorithm as a multi-objective optimization problem, our framework allows for generating diverse models that trade off high performance and ease of interpretability in a single optimization run. Efficient optimization is achieved via augmentation of the search space of the learning algorithm by incorporating feature selection, interaction and monotonicity constraints into the hyperparameter search space. We demonstrate that the optimization problem effectively translates to finding the Pareto optimal set of groups of selected features that are allowed to interact in a model, along with finding their optimal monotonicity constraints and optimal hyperparameters of the learning algorithm itself. We then introduce a novel evolutionary algorithm that can operate efficiently on this augmented search space. In benchmark experiments, we show that our framework is capable of finding diverse models that are highly competitive or outperform state-of-the-art XGBoost or Explainable Boosting Machine models, both with respect to performance and interpretability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/30/2019

Model-Agnostic Approaches to Multi-Objective Simultaneous Hyperparameter Tuning and Feature Selection

Highly non-linear machine learning algorithms have the capacity to handl...
research
06/15/2022

Multi-Objective Hyperparameter Optimization – An Overview

Hyperparameter optimization constitutes a large part of typical modern m...
research
07/17/2023

Cross Feature Selection to Eliminate Spurious Interactions and Single Feature Dominance Explainable Boosting Machines

Interpretability is a crucial aspect of machine learning models that ena...
research
11/10/2022

A metaheuristic multi-objective interaction-aware feature selection method

Multi-objective feature selection is one of the most significant issues ...
research
11/20/2020

A Dynamic Locality Multi-Objective Salp Swarm Algorithm for Feature Selection

Developing intelligent analytical tools requires pre-processing data and...
research
03/20/2023

A fuzzy adaptive evolutionary-based feature selection and machine learning framework for single and multi-objective body fat prediction

Predicting body fat can provide medical practitioners and users with ess...
research
10/25/2017

Feature learning in feature-sample networks using multi-objective optimization

Data and knowledge representation are fundamental concepts in machine le...

Please sign up or login with your details

Forgot password? Click here to reset