VizNet: Towards A Large-Scale Visualization Learning and Benchmarking Repository

05/12/2019
by   Kevin Hu, et al.
0

Researchers currently rely on ad hoc datasets to train automated visualization tools and evaluate the effectiveness of visualization designs. These exemplars often lack the characteristics of real-world datasets, and their one-off nature makes it difficult to compare different techniques. In this paper, we present VizNet: a large-scale corpus of over 31 million datasets compiled from open data repositories and online visualization galleries. On average, these datasets comprise 17 records over 3 dimensions and across the corpus, we find 51 quantitative, and only 5 baseline for comparing visualization design techniques, and developing benchmark models and algorithms for automating visual analysis. To demonstrate VizNet's utility as a platform for conducting online crowdsourced experiments at scale, we replicate a prior study assessing the influence of user task and data distribution on visual encoding effectiveness, and extend it by considering an additional task: outlier detection. To contend with running such studies at scale, we demonstrate how a metric of perceptual effectiveness can be learned from experimental results, and show its predictive power across test datasets.

READ FULL TEXT

page 2

page 3

page 4

page 6

page 9

page 10

page 11

page 12

research
08/14/2018

VizML: A Machine Learning Approach to Visualization Recommendation

Data visualization should be accessible for all analysts with data, not ...
research
09/07/2020

VisCode: Embedding Information in Visualization Images using Encoder-Decoder Network

We present an approach called VisCode for embedding information into vis...
research
12/20/2022

The Risks of Ranking: Revisiting Graphical Perception to Model Individual Differences in Visualization Performance

Graphical perception studies typically measure visualization encoding ef...
research
01/23/2020

Phoenixmap: An Abstract Approach to Visualize 2D Spatial Distributions

The multidimensional nature of spatial data poses a challenge for visual...
research
09/08/2020

Improving Engagement of Animated Visualization with Visual Foreshadowing

Animated visualization is becoming increasingly popular as a compelling ...
research
11/30/2020

Toward a Benchmark Repository for Software Maintenance Tool Evaluations with Humans

To evaluate software maintenance techniques and tools in controlled expe...
research
06/18/2019

Selection Bias Tracking and Detailed Subset Comparison for High-Dimensional Data

The collection of large, complex datasets has become common across a wid...

Please sign up or login with your details

Forgot password? Click here to reset