DumbleDR: Predicting User Preferences of Dimensionality Reduction Projection Quality

05/19/2021
by   Cristina Morariu, et al.
0

A plethora of dimensionality reduction techniques have emerged over the past decades, leaving researchers and analysts with a wide variety of choices for reducing their data, all the more so given some techniques come with additional parametrization (e.g. t-SNE, UMAP, etc.). Recent studies are showing that people often use dimensionality reduction as a black-box regardless of the specific properties the method itself preserves. Hence, evaluating and comparing 2D projections is usually qualitatively decided, by setting projections side-by-side and letting human judgment decide which projection is the best. In this work, we propose a quantitative way of evaluating projections, that nonetheless places human perception at the center. We run a comparative study, where we ask people to select 'good' and 'misleading' views between scatterplots of low-level projections of image datasets, simulating the way people usually select projections. We use the study data as labels for a set of quality metrics whose purpose is to discover and quantify what exactly people are looking for when deciding between projections. With this proxy for human judgments, we use it to rank projections on new datasets, explain why they are relevant, and quantify the degree of subjectivity in projections selected.

READ FULL TEXT

page 6

page 7

page 9

page 11

page 14

research
02/22/2019

A Review, Framework and R toolkit for Exploring, Evaluating, and Comparing Visualizations

This paper gives a review and synthesis of methods of evaluating dimensi...
research
06/07/2021

HoroPCA: Hyperbolic Dimensionality Reduction via Horospherical Projections

This paper studies Principal Component Analysis (PCA) for data lying in ...
research
02/18/2020

Quantitative Evaluation of Time-Dependent Multidimensional Projection Techniques

Dimensionality reduction methods are an essential tool for multidimensio...
research
06/01/2023

ShaRP: Shape-Regularized Multidimensional Projections

Projections, or dimensionality reduction methods, are techniques of choi...
research
01/16/2013

Experiments with Random Projection

Recent theoretical work has identified random projection as a promising ...
research
07/28/2023

SAP-sLDA: An Interpretable Interface for Exploring Unstructured Text

A common way to explore text corpora is through low-dimensional projecti...
research
07/08/2022

Evaluating Systemic Error Detection Methods using Synthetic Images

We introduce SpotCheck, a framework for generating synthetic datasets to...

Please sign up or login with your details

Forgot password? Click here to reset