CLAMS: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering

08/01/2023
by   Hyeon Jeon, et al.
0

Visual clustering is a common perceptual task in scatterplots that supports diverse analytics tasks (e.g., cluster identification). However, even with the same scatterplot, the ways of perceiving clusters (i.e., conducting visual clustering) can differ due to the differences among individuals and ambiguous cluster boundaries. Although such perceptual variability casts doubt on the reliability of data analysis based on visual clustering, we lack a systematic way to efficiently assess this variability. In this research, we study perceptual variability in conducting visual clustering, which we call Cluster Ambiguity. To this end, we introduce CLAMS, a data-driven visual quality measure for automatically predicting cluster ambiguity in monochrome scatterplots. We first conduct a qualitative study to identify key factors that affect the visual separation of clusters (e.g., proximity or size difference between clusters). Based on study findings, we deploy a regression module that estimates the human-judged separability of two clusters. Then, CLAMS predicts cluster ambiguity by analyzing the aggregated results of all pairwise separability between clusters that are generated by the module. CLAMS outperforms widely-used clustering techniques in predicting ground truth cluster ambiguity. Meanwhile, CLAMS exhibits performance on par with human annotators. We conclude our work by presenting two applications for optimizing and benchmarking data mining techniques using CLAMS. The interactive demo of CLAMS is available at clusterambiguity.dev.

READ FULL TEXT
research
04/27/2023

ClusterNet: A Perception-Based Clustering Model for Scattered Data

Cluster separation in scatterplots is a task that is typically tackled b...
research
11/28/2017

A fatal point concept and a low-sensitivity quantitative measure for traffic safety analytics

The variability of the clusters generated by clustering techniques in th...
research
07/27/2020

Modeling the Influence of Visual Density on Cluster Perception in Scatterplots Using Topology

Scatterplots are used for a variety of visual analytics tasks, including...
research
11/03/2019

Geono-Cluster: Interactive Visual Cluster Analysis for Biologists

Biologists often perform clustering analysis to derive meaningful patter...
research
09/20/2022

A Framework for Benchmarking Clustering Algorithms

The evaluation of clustering algorithms can be performed by running them...
research
09/06/2020

Designing for Ambiguity: Visual Analytics in Avalanche Forecasting

Ambiguity, an information state where multiple interpretations are plaus...
research
06/01/2021

ClustRank: a Visual Quality Measure Trained on Perceptual Data for Sorting Scatterplots by Cluster Patterns

Visual quality measures (VQMs) are designed to support analysts by autom...

Please sign up or login with your details

Forgot password? Click here to reset