Lux: Always-on Visualization Recommendations for Exploratory Data Science

04/30/2021
by   Doris Jung-Lin Lee, et al.
0

Exploratory data science largely happens in computational notebooks with dataframe API, such as pandas, that support flexible means to transform, clean, and analyze data. Yet, visually exploring data in dataframes remains tedious, requiring substantial programming effort for visualization and mental effort to determine what analysis to perform next. We propose Lux, an always-on framework for accelerating visual insight discovery in data science workflows. When users print a dataframe in their notebooks, Lux recommends visualizations to provide a quick overview of the patterns and trends and suggests promising analysis directions. Lux features a high-level language for generating visualizations on-demand to encourage rapid visual experimentation with data. We demonstrate that through the use of a careful design and three system optimizations, Lux adds no more than two seconds of overhead on top of pandas for over 98 datasets in the UCI repository. We evaluate Lux in terms of usability via a controlled first-use study and interviews with early adopters, finding that Lux helps fulfill the needs of data scientists for visualization support within their dataframe workflows. Lux has already been embraced by data science practitioners, with over 1.9k stars on Github within its first 15 months.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/16/2022

Lodestar: Supporting Independent Learning and Rapid Experimentation Through Data-Driven Analysis Recommendations

Keeping abreast of current trends, technologies, and best practices in v...
research
04/11/2023

Mining the Characteristics of Jupyter Notebooks in Data Science Projects

Nowadays, numerous industries have exceptional demand for skills in data...
research
07/30/2019

Decoding a Complex Visualization in a Science Museum -- An Empirical Study

This study describes a detailed analysis of museum visitors' decoding pr...
research
03/29/2021

Meeting in the notebook: a notebook-based environment for micro-submissions in data science collaborations

Developers in data science and other domains frequently use computationa...
research
08/07/2023

Notably Inaccessible – Data Driven Understanding of Data Science Notebook (In)Accessibility

Computational notebooks, tools that facilitate storytelling through expl...
research
09/06/2022

Code Code Evolution: Understanding How People Change Data Science Notebooks Over Time

Sensemaking is the iterative process of identifying, extracting, and exp...
research
07/29/2016

3D visualization of astronomy data cubes using immersive displays

We report on an exploratory project aimed at performing immersive 3D vis...

Please sign up or login with your details

Forgot password? Click here to reset