Class-constrained t-SNE: Combining Data Features and Class Probabilities

08/26/2023
by   Linhao Meng, et al.
0

Data features and class probabilities are two main perspectives when, e.g., evaluating model results and identifying problematic items. Class probabilities represent the likelihood that each instance belongs to a particular class, which can be produced by probabilistic classifiers or even human labeling with uncertainty. Since both perspectives are multi-dimensional data, dimensionality reduction (DR) techniques are commonly used to extract informative characteristics from them. However, existing methods either focus solely on the data feature perspective or rely on class probability estimates to guide the DR process. In contrast to previous work where separate views are linked to conduct the analysis, we propose a novel approach, class-constrained t-SNE, that combines data features and class probabilities in the same DR result. Specifically, we combine them by balancing two corresponding components in a cost function to optimize the positions of data points and iconic representation of classes – class landmarks. Furthermore, an interactive user-adjustable parameter balances these two components so that users can focus on the weighted perspectives of interest and also empowers a smooth visual transition between varying perspectives to preserve the mental map. We illustrate its application potential in model evaluation and visual-interactive labeling. A comparative analysis is performed to evaluate the DR results.

READ FULL TEXT

page 2

page 3

page 4

page 6

page 7

page 8

page 9

page 12

research
06/29/2021

Interactive Dimensionality Reduction for Comparative Analysis

Finding the similarities and differences between groups of datasets is a...
research
05/10/2019

An Incremental Dimensionality Reduction Method for Visualizing Streaming Multidimensional Data

Dimensionality reduction (DR) methods are commonly used for analyzing an...
research
03/09/2021

Explaining dimensionality reduction results using Shapley values

Dimensionality reduction (DR) techniques have been consistently supporti...
research
01/15/2021

Multi-point dimensionality reduction to improve projection layout reliability

In ordinary Dimensionality Reduction (DR), each data instance in an m-di...
research
08/01/2023

Classes are not Clusters: Improving Label-based Evaluation of Dimensionality Reduction

A common way to evaluate the reliability of dimensionality reduction (DR...
research
08/30/2020

ChemVA: Interactive Visual Analysis of Chemical Compound Similarity in Virtual Screening

In the modern drug discovery process, medicinal chemists deal with the c...
research
10/31/2018

Dimensionality Reduction has Quantifiable Imperfections: Two Geometric Bounds

In this paper, we investigate Dimensionality reduction (DR) maps in an i...

Please sign up or login with your details

Forgot password? Click here to reset