Modeling the Influence of Visual Density on Cluster Perception in Scatterplots Using Topology

07/27/2020
by   Ghulam Jilani Quadri, et al.
0

Scatterplots are used for a variety of visual analytics tasks, including cluster identification, and the visual encodings used on a scatterplot play a deciding role on the level of visual separation of clusters. For visualization designers, optimizing the visual encodings is crucial to maximizing the clarity of data. This requires accurately modeling human perception of cluster separation, which remains challenging. We present a multi-stage user study focusing on 4 factors—distribution size of clusters, number of points, size of points, and opacity of points—that influence cluster identification in scatterplots. From these parameters, we have constructed 2 models, a distance-based model, and a density-based model, using the merge tree data structure from Topological Data Analysis. Our analysis demonstrates that these factors play an important role in the number of clusters perceived, and it verifies that the distance-based and density-based models can reasonably estimate the number of clusters a user observes. Finally, we demonstrate how these models can be used to optimize visual encodings on real-world data.

READ FULL TEXT

page 1

page 9

research
07/07/2022

Automatic Scatterplot Design Optimization for Clustering Identification

Scatterplots are among the most widely used visualization techniques. Co...
research
08/01/2023

CLAMS: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering

Visual clustering is a common perceptual task in scatterplots that suppo...
research
07/20/2015

A Parameter-free Affinity Based Clustering

Several methods have been proposed to estimate the number of clusters in...
research
08/26/2016

Estimating the Number of Clusters via Normalized Cluster Instability

We improve existing instability-based methods for the selection of the n...
research
04/27/2023

ClusterNet: A Perception-Based Clustering Model for Scattered Data

Cluster separation in scatterplots is a task that is typically tackled b...
research
04/17/2018

Classifying Antimicrobial and Multifunctional Peptides with Bayesian Network Models

Bayesian network models are finding success in characterizing enzyme-cat...
research
10/22/2020

Efficient design of geographically-defined clusters with spatial autocorrelation

Clusters form the basis of a number of research study designs including ...

Please sign up or login with your details

Forgot password? Click here to reset