Eliciting Best Practices for Collaboration with Computational Notebooks

02/15/2022
by   Luigi Quaranta, et al.
0

Despite the widespread adoption of computational notebooks, little is known about best practices for their usage in collaborative contexts. In this paper, we fill this gap by eliciting a catalog of best practices for collaborative data science with computational notebooks. With this aim, we first look for best practices through a multivocal literature review. Then, we conduct interviews with professional data scientists to assess their awareness of these best practices. Finally, we assess the adoption of best practices through the analysis of 1,380 Jupyter notebooks retrieved from the Kaggle platform. Findings reveal that experts are mostly aware of the best practices and tend to adopt them in their daily work. Nonetheless, they do not consistently follow all the recommendations as, depending on specific contexts, some are deemed unfeasible or counterproductive due to the lack of proper tool support. As such, we envision the design of notebook solutions that allow data scientists not to have to prioritize exploration and rapid prototyping over writing code of quality.

READ FULL TEXT
research
09/05/2019

Human-AI Collaboration in Data Science: Exploring Data Scientists' Perceptions of Automated AI

The rapid advancement of artificial intelligence (AI) is changing our li...
research
08/07/2023

Notably Inaccessible – Data Driven Understanding of Data Science Notebook (In)Accessibility

Computational notebooks, tools that facilitate storytelling through expl...
research
01/28/2021

Best Practices in Scientific Computing

The world is becoming increasingly complex, both in terms of the rich so...
research
01/10/2023

How Data Scientists Review the Scholarly Literature

Keeping up with the research literature plays an important role in the w...
research
02/24/2021

Themisto: Towards Automated Documentation Generation in Computational Notebooks

Computational notebooks allow data scientists to express their ideas thr...
research
11/11/2021

Developing and Publishing Code for Trusted Research Environments: Best Practices and Ways of Working

This report discusses 3 distinct, but overlapping topics. Firstly, it re...
research
05/24/2022

Assessing the Quality of Computational Notebooks for a Frictionless Transition from Exploration to Production

The massive trend of integrating data-driven AI capabilities into tradit...

Please sign up or login with your details

Forgot password? Click here to reset