Interactive Decision Tree Creation and Enhancement with Complete Visualization for Explainable Modeling

To increase the interpretability and prediction accuracy of the Machine Learning (ML) models, visualization of ML models is a key part of the ML process. Decision Trees (DTs) are essential in machine learning (ML) because they are used to understand many black box ML models including Deep Learning models. In this research, two new methods for creation and enhancement with complete visualizing Decision Trees as understandable models are suggested. These methods use two versions of General Line Coordinates (GLC): Bended Coordinates (BC) and Shifted Paired Coordinates (SPC). The Bended Coordinates are a set of line coordinates, where each coordinate is bended in a threshold point of the respective DT node. In SPC, each n-D point is visualized in a set of shifted pairs of 2-D Cartesian coordinates as a directed graph. These new methods expand and complement the capabilities of existing methods to visualize DT models more completely. These capabilities allow us to observe and analyze: (1) relations between attributes, (2) individual cases relative to the DT structure, (3) data flow in the DT, (4) sensitivity of each split threshold in the DT nodes, and (5) density of cases in parts of the n-D space. These features are critical for DT models' performance evaluation and improvement by domain experts and end users as they help to prevent overgeneralization and overfitting of the models. The advantages of this methodology are illustrated in the case studies on benchmark real-world datasets. The paper also demonstrates how to generalize them for decision tree visualizations in different General Line Coordinates.

READ FULL TEXT

page 12

page 14

page 18

page 19

page 20

page 22

page 23

page 25

research
09/19/2022

TimberTrek: Exploring and Curating Sparse Decision Trees with Interactive Visualization

Given thousands of equally accurate machine learning (ML) models, how ca...
research
06/14/2021

Discovering Interpretable Machine Learning Models in Parallel Coordinates

This paper contributes to interpretable machine learning via visual know...
research
06/14/2021

Full interpretable machine learning in 2D with inline coordinates

This paper proposed a new methodology for machine learning in 2-dimensio...
research
12/01/2021

VisRuler: Visual Analytics for Extracting Decision Rules from Bagged and Boosted Decision Trees

Bagging and boosting are two popular ensemble methods in machine learnin...
research
03/31/2023

DeforestVis: Behavior Analysis of Machine Learning Models with Surrogate Decision Stumps

As the complexity of machine learning (ML) models increases and the appl...
research
06/10/2023

Interpretable Differencing of Machine Learning Models

Understanding the differences between machine learning (ML) models is of...
research
04/24/2023

Incorporating Experts' Judgment into Machine Learning Models

Machine learning (ML) models have been quite successful in predicting ou...

Please sign up or login with your details

Forgot password? Click here to reset