VizML: A Machine Learning Approach to Visualization Recommendation

08/14/2018
by   Kevin Z. Hu, et al.
8

Data visualization should be accessible for all analysts with data, not just the few with technical expertise. Visualization recommender systems aim to lower the barrier to exploring basic visualizations by automatically generating results for analysts to search and select, rather than manually specify. Here, we demonstrate a novel machine learning-based approach to visualization recommendation that learns visualization design choices from a large corpus of datasets and associated visualizations. First, we identify five key design choices made by analysts while creating visualizations, such as selecting a visualization type and choosing to encode a column along the X- or Y-axis. We train models to predict these design choices using one million dataset-visualization pairs collected from a popular online visualization platform. Neural networks predict these design choices with high accuracy compared to baseline models. We report and interpret feature importances from one of these baseline models. To evaluate the generalizability and uncertainty of our approach, we benchmark with a crowdsourced test set, and show that the performance of our model is comparable to human performance when predicting consensus visualization type, and exceeds that of other ML-based systems.

READ FULL TEXT

page 4

page 6

page 7

page 10

research
09/25/2020

ML-based Visualization Recommendation: Learning to Recommend Visualizations from Data

Visualization recommendation seeks to generate, score, and recommend to ...
research
05/12/2019

VizNet: Towards A Large-Scale Visualization Learning and Benchmarking Repository

Researchers currently rely on ad hoc datasets to train automated visuali...
research
07/27/2021

KG4Vis: A Knowledge Graph-Based Approach for Visualization Recommendation

Visualization recommendation or automatic visualization generation can s...
research
04/09/2018

Data2Vis: Automatic Generation of Data Visualizations Using Sequence-to-Sequence Recurrent Neural Networks

Rapidly creating effective visualizations using expressive grammars is c...
research
07/29/2020

Advancing Visual Specification of Code Requirements for Graphs

Researchers in the humanities are among the many who are now exploring t...
research
10/22/2020

GEViTRec: Data Reconnaissance Through Recommendation Using a Domain-Specific Prevalence Visualization Design Space

Genomic Epidemiology (genEpi) is a branch of public health that uses man...

Please sign up or login with your details

Forgot password? Click here to reset