ClustRank: a Visual Quality Measure Trained on Perceptual Data for Sorting Scatterplots by Cluster Patterns

06/01/2021
by   Mostafa Abbas, et al.
23

Visual quality measures (VQMs) are designed to support analysts by automatically detecting and quantifying patterns in visualizations. We propose a new data-driven technique called ClustRank that allows to rank scatterplots according to visible grouping patterns. Our model first encodes scatterplots in the parametric space of a Gaussian Mixture Model, and then uses a classifier trained on human judgment data to estimate the perceptual complexity of grouping patterns. The numbers of initial mixture components and final combined groups determine the rank of the scatterplot. ClustRank improves on existing VQM techniques by mimicking human judgments on two-Gaussian cluster patterns and gives more accuracy when ranking general cluster patterns in scatterplots. We demonstrate its benefit by analyzing kinship data for genome-wide association studies, a domain in which experts rely on the visual analysis of large sets of scatterplots. We make the three benchmark datasets and the ClustRank VQM available for practical use and further improvements.

READ FULL TEXT

page 4

page 6

research
10/08/2020

Mixture-based estimation of entropy

The entropy is a measure of uncertainty that plays a central role in inf...
research
11/18/2020

Skewed Distributions or Transformations? Modelling Skewness for a Cluster Analysis

Because of its mathematical tractability, the Gaussian mixture model hol...
research
12/17/2020

Smoothed Gaussian Mixture Models for Video Classification and Recommendation

Cluster-and-aggregate techniques such as Vector of Locally Aggregated De...
research
07/11/2006

Interactive Hatching and Stippling by Example

We describe a system that lets a designer interactively draw patterns of...
research
11/03/2019

Geono-Cluster: Interactive Visual Cluster Analysis for Biologists

Biologists often perform clustering analysis to derive meaningful patter...
research
08/01/2023

CLAMS: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering

Visual clustering is a common perceptual task in scatterplots that suppo...
research
12/24/2015

Measuring pattern retention in anonymized data -- where one measure is not enough

In this paper, we explore how modifying data to preserve privacy affects...

Please sign up or login with your details

Forgot password? Click here to reset