DeepAI AI Chat
Log In Sign Up

Lodestar: Supporting Independent Learning and Rapid Experimentation Through Data-Driven Analysis Recommendations

04/16/2022
by   Deepthi Raghunandan, et al.
Google
University of Maryland
University of Washington
0

Keeping abreast of current trends, technologies, and best practices in visualization and data analysis is becoming increasingly difficult, especially for fledgling data scientists. In this paper, we propose Lodestar, an interactive computational notebook that allows users to quickly explore and construct new data science workflows by selecting from a list of automated analysis recommendations. We derive our recommendations from directed graphs of known analysis states, with two input sources: one manually curated from online data science tutorials, and another extracted through semi-automatic analysis of a corpus of over 6,000 Jupyter notebooks. We evaluate Lodestar in a formative study guiding our next set of improvements to the tool. Our results suggest that users find Lodestar useful for rapidly creating data science workflows.

READ FULL TEXT

page 1

page 4

page 6

page 7

08/07/2023

Notably Inaccessible – Data Driven Understanding of Data Science Notebook (In)Accessibility

Computational notebooks, tools that facilitate storytelling through expl...
04/30/2021

Lux: Always-on Visualization Recommendations for Exploratory Data Science

Exploratory data science largely happens in computational notebooks with...
02/19/2022

Tools and Recommendations for Reproducible Teaching

It is recommended that teacher-scholars of data science adopt reproducib...
05/13/2021

Global Wheat Challenge 2020: Analysis of the competition design and winning models

Data competitions have become a popular approach to crowdsource new data...
07/20/2020

Opening practice: supporting Reproducibility and Critical spatial data science

This paper reflects on a number of trends towards a more open and reprod...
06/23/2023

Effective data reduction algorithm for topological data analysis

One of the most interesting tools that have recently entered the data sc...