Gatherplot: A Non-Overlapping Scatterplot

01/25/2023
by   Deokgun Park, et al.
0

Scatterplots are a common tool for exploring multidimensional datasets, especially in the form of scatterplot matrices (SPLOMs). However, scatterplots suffer from overplotting when categorical variables are mapped to one or two axes, or the same continuous variable is used for both axes. Previous methods such as histograms or violin plots use aggregation, which makes brushing and linking difficult. To address this, we propose gatherplots, an extension of scatterplots to manage the overplotting problem. Gatherplots are a form of unit visualization, which avoid aggregation and maintain the identity of individual objects to ease visual perception. In gatherplots, every visual mark that maps to the same position coalesces to form a packed entity, thereby making it easier to see the overview of data groupings. The size and aspect ratio of marks can also be changed dynamically to make it easier to compare the composition of different groups. In the case of a categorical variable vs. a categorical variable, we propose a heuristic to decide bin sizes for optimal space usage. To validate our work, we conducted a crowdsourced user study that shows that gatherplots enable people to assess data distribution more quickly and more correctly than when using jittered scatterplots.

READ FULL TEXT

page 3

page 5

page 9

page 10

page 11

page 13

page 14

research
08/27/2017

Gatherplots: Generalized Scatterplots for Nominal Data

Overplotting of data points is a common problem when visualizing large d...
research
12/16/2017

Taggle: Scalable Visualization of Tabular Data through Aggregation

Visualization of tabular data---for both presentation and exploration pu...
research
11/03/2020

Palette diagram: A Python package for visualization of collective categorical data

Categorical data, wherein a numerical quantity is assigned to each categ...
research
02/28/2007

Consumer Profile Identification and Allocation

We propose an easy-to-use methodology to allocate one of the groups whic...
research
02/10/2022

Bayesian Optimisation for Mixed-Variable Inputs using Value Proposals

Many real-world optimisation problems are defined over both categorical ...
research
07/29/2019

ICE: An Interactive Configuration Explorer for High Dimensional Categorical Parameter Spaces

There are many applications where users seek to explore the impact of th...
research
07/17/2023

A benchmark of categorical encoders for binary classification

Categorical encoders transform categorical features into numerical repre...

Please sign up or login with your details

Forgot password? Click here to reset