Pynblint: a Static Analyzer for Python Jupyter Notebooks

05/24/2022
by   Luigi Quaranta, et al.
0

Jupyter Notebook is the tool of choice of many data scientists in the early stages of ML workflows. The notebook format, however, has been criticized for inducing bad programming practices; indeed, researchers have already shown that open-source repositories are inundated by poor-quality notebooks. Low-quality output from the prototypical stages of ML workflows constitutes a clear bottleneck towards the productization of ML models. To foster the creation of better notebooks, we developed Pynblint, a static analyzer for Jupyter notebooks written in Python. The tool checks the compliance of notebooks (and surrounding repositories) with a set of empirically validated best practices and provides targeted recommendations when violations are detected.

READ FULL TEXT

page 1

page 2

research
03/11/2023

NICHE: A Curated Dataset of Engineered Machine Learning Projects in Python

Machine learning (ML) has gained much attention and been incorporated in...
research
08/08/2023

A Dataset and Analysis of Open-Source Machine Learning Products

Machine learning (ML) components are increasingly incorporated into soft...
research
03/30/2020

QRMine: A python package for triangulation in Grounded Theory

Grounded theory (GT) is a qualitative research method for building theor...
research
09/20/2017

Pandapower - an Open Source Python Tool for Convenient Modeling, Analysis and Optimization of Electric Power Systems

Pandapower is a Python based, BSD-licensed power system analysis tool ai...
research
03/01/2021

PHIDL: Python CAD layout and geometry creation for nanolithography

Computer-aided design (CAD) has become a critical element in the creatio...
research
05/19/2022

Fidyll: A Compiler for Cross-Format Data Stories Explorable Explanations

Narrative visualization is a powerful communicative tool that can take o...

Please sign up or login with your details

Forgot password? Click here to reset