ConfusionFlow: A model-agnostic visualization for temporal analysis of classifier confusion

10/02/2019
by   Andreas Hinterreiter, et al.
0

Classifiers are among the most widely used supervised machine learning algorithms. Many classification models exist, and choosing the right one for a given task is difficult. During model selection and debugging, data scientists need to asses classifier performance, evaluate the training behavior over time, and compare different models. Typically, this analysis is based on single-number performance measures such as accuracy. A more detailed evaluation of classifiers is possible by inspecting class errors. The confusion matrix is an established way for visualizing these class errors, but it was not designed with temporal or comparative analysis in mind. More generally, established performance analysis systems do not allow a combined temporal and comparative analysis of class-level information. To address this issue, we propose ConfusionFlow, an interactive, comparative visualization tool that combines the benefits of class confusion matrices with the visualization of performance characteristics over time. ConfusionFlow is model-agnostic and can be used to compare performances for different model types, model architectures, and/or training and test datasets. We demonstrate the usefulness of ConfusionFlow in the context of two practical problems: an analysis of the influence of network pruning on model errors, and a case study on instance selection strategies in active learning.

READ FULL TEXT

page 4

page 13

research
07/22/2020

InstanceFlow: Visualizing the Evolution of Classifier Confusion on the Instance Level

Classification is one of the most important supervised machine learning ...
research
08/09/2021

Probabilistic Active Learning for Active Class Selection

In machine learning, active class selection (ACS) algorithms aim to acti...
research
11/09/2021

An Interactive Visualization Tool for Understanding Active Learning

Despite recent progress in artificial intelligence and machine learning,...
research
04/16/2020

Boxer: Interactive Comparison of Classifier Results

Machine learning practitioners often compare the results of different cl...
research
12/31/2021

DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training

Understanding how the predictions of deep learning models are formed dur...
research
11/14/2022

Model Evaluation in Medical Datasets Over Time

Machine learning models deployed in healthcare systems face data drawn f...

Please sign up or login with your details

Forgot password? Click here to reset