DumbleDR: Predicting User Preferences of Dimensionality Reduction Projection Quality

by   Cristina Morariu, et al.

A plethora of dimensionality reduction techniques have emerged over the past decades, leaving researchers and analysts with a wide variety of choices for reducing their data, all the more so given some techniques come with additional parametrization (e.g. t-SNE, UMAP, etc.). Recent studies are showing that people often use dimensionality reduction as a black-box regardless of the specific properties the method itself preserves. Hence, evaluating and comparing 2D projections is usually qualitatively decided, by setting projections side-by-side and letting human judgment decide which projection is the best. In this work, we propose a quantitative way of evaluating projections, that nonetheless places human perception at the center. We run a comparative study, where we ask people to select 'good' and 'misleading' views between scatterplots of low-level projections of image datasets, simulating the way people usually select projections. We use the study data as labels for a set of quality metrics whose purpose is to discover and quantify what exactly people are looking for when deciding between projections. With this proxy for human judgments, we use it to rank projections on new datasets, explain why they are relevant, and quantify the degree of subjectivity in projections selected.



There are no comments yet.


page 6

page 7

page 9

page 11

page 14


A Review, Framework and R toolkit for Exploring, Evaluating, and Comparing Visualizations

This paper gives a review and synthesis of methods of evaluating dimensi...

HoroPCA: Hyperbolic Dimensionality Reduction via Horospherical Projections

This paper studies Principal Component Analysis (PCA) for data lying in ...

Experiments with Random Projection

Recent theoretical work has identified random projection as a promising ...

Quantitative Evaluation of Time-Dependent Multidimensional Projection Techniques

Dimensionality reduction methods are an essential tool for multidimensio...

Deep Learning Multidimensional Projections

Dimensionality reduction methods, also known as projections, are frequen...

Interactive dimensionality reduction using similarity projections

Recent advances in machine learning allow us to analyze and describe the...

DimReader: Using auto-differentiation to explain non-linear projections

Non-linear dimensionality reduction (NDR) methods such as LLE and t-SNE ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.