Contrastive analysis for scatter plot-based representations of dimensionality reduction

01/26/2021
by   Wilson E. Marcílio-Jr, et al.
7

Exploring multidimensional datasets is a ubiquitous part of the ones working with data, where interpreting clusters is one of the main tasks. These multidimensional datasets are usually encoded using scatter-plots representations, where spatial proximity encodes similarity among data samples. In the literature, techniques try to understand the scatter plot organization by visualizing the importance of the features for clusters definition with interaction and layout enrichment strategies. However, the approaches used to interpret dimensionality reduction usually do not differentiate clusters well, which hampers analysis where the focus is to understand the differences among clusters. This paper introduces a methodology to visually explore multidimensional datasets and interpret clusters' formation based on the contrastive analysis. We also introduce a bipartite graph to visually interpret and explore the relationship between the statistical variables used to understand how the attributes influenced cluster formation. Our methodology is validated through case studies. We explore a multivariate dataset of patients with vertebral problems and two document collections, one related to news articles and other related to tweets about COVID-19 symptoms. Finally, we also validate our approach through quantitative results to demonstrate how it can be robust enough to support multidimensional analysis.

READ FULL TEXT

page 6

page 7

page 8

page 10

page 12

page 13

page 14

page 15

research
03/09/2021

Explaining dimensionality reduction results using Shapley values

Dimensionality reduction (DR) techniques have been consistently supporti...
research
05/10/2019

Supporting Analysis of Dimensionality Reduction Results with Contrastive Learning

Dimensionality reduction (DR) is frequently used for analyzing and visua...
research
06/20/2021

ExplorerTree: a focus+context exploration approach for 2D embeddings

In exploratory tasks involving high-dimensional datasets, dimensionality...
research
02/21/2019

Deep Learning Multidimensional Projections

Dimensionality reduction methods, also known as projections, are frequen...
research
08/29/2023

Dimensionality Reduction Using pseudo-Boolean polynomials For Cluster Analysis

We introduce usage of a reduction property of penalty-based formulation ...
research
02/18/2020

Quantitative Evaluation of Time-Dependent Multidimensional Projection Techniques

Dimensionality reduction methods are an essential tool for multidimensio...
research
09/18/2023

Traffic Scene Similarity: a Graph-based Contrastive Learning Approach

Ensuring validation for highly automated driving poses significant obsta...

Please sign up or login with your details

Forgot password? Click here to reset