Neo: Generalizing Confusion Matrix Visualization to Hierarchical and Multi-Output Labels

10/24/2021
by   Jochen Görtler, et al.
13

The confusion matrix, a ubiquitous visualization for helping people evaluate machine learning models, is a tabular layout that compares predicted class labels against actual class labels over all data instances. We conduct formative research with machine learning practitioners at a large technology company and find that conventional confusion matrices do not support more complex data-structures found in modern-day applications, such as hierarchical and multi-output labels. To express such variations of confusion matrices, we design an algebra that models confusion matrices as probability distributions. Based on this algebra, we develop Neo, a visual analytics system that enables practitioners to flexibly author and interact with hierarchical and multi-output confusion matrices, visualize derived metrics, renormalize confusions, and share matrix specifications. Finally, we demonstrate Neo's utility with three case studies that help people better understand model performance and reveal hidden confusions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2019

Explaining Vulnerabilities to Adversarial Machine Learning through Visual Analytics

Machine learning models are currently being deployed in a variety of rea...
research
07/27/2022

Calibrate: Interactive Analysis of Probabilistic Model Output

Analyzing classification model performance is a crucial task for machine...
research
02/22/2018

The State of the Art in Integrating Machine Learning into Visual Analytics

Visual analytics systems combine machine learning or other analytic tech...
research
02/08/2021

Learning with Density Matrices and Random Features

A density matrix describes the statistical state of a quantum system. It...
research
08/07/2023

My Model is Unfair, Do People Even Care? Visual Design Affects Trust and Perceived Bias in Machine Learning

Machine learning technology has become ubiquitous, but, unfortunately, o...
research
01/30/2022

ClassSPLOM – A Scatterplot Matrix to Visualize Separation of Multiclass Multidimensional Data

In multiclass classification of multidimensional data, the user wants to...

Please sign up or login with your details

Forgot password? Click here to reset