On Code Reuse from StackOverflow: An Exploratory Study on Jupyter Notebook

02/23/2023
by   Mingke Yang, et al.
0

Jupyter Notebook is a popular tool among data analysts and scientists for working with data. It provides a way to combine code, documentation, and visualizations in a single, interactive environment, facilitating code reuse. While code reuse can improve programming efficiency, it can also decrease readability, security, and overall performance. We conduct a large-scale exploratory study of code reuse practices in the Jupyter Notebook development community on the Stack Overflow platform to understand the potential negative impacts of code reuse. Our findings identified 1,097,470 Jupyter Notebook clone pairs that reuse Stack Overflow code snippets, and the average code snippet has 7.91 code quality violations. Through our research, we gain insight into the reasons behind Jupyter Notebook developers' decision to reuse code and the potential drawbacks of this practice.

READ FULL TEXT

page 1

page 15

page 16

page 21

page 22

research
04/27/2022

Towards Exploring the Code Reuse from Stack Overflow during Software Development

As one of the most well-known programmer Q A websites, Stack Overflow ...
research
05/27/2020

Code Duplication and Reuse in Jupyter Notebooks

Duplicating one's own code makes it faster to write software. This exped...
research
01/30/2023

Faster or Slower? Performance Mystery of Python Idioms Unveiled with Empirical Evidence

The usage of Python idioms is popular among Python developers in a forma...
research
10/02/2020

Augmenting Machine Learning with Information Retrieval to Recommend Real Cloned Code Methods for Code Completion

Software developers frequently reuse source code from repositories as it...
research
05/23/2020

Find Unique Usages: Helping Developers Understand Common Usages

When working in large and complex codebases, developers face challenges ...
research
03/23/2021

A large-scale study on research code quality and execution

This article presents a study on the quality and execution of research c...
research
02/04/2020

Code Duplication on Stack Overflow

Despite the unarguable importance of Stack Overflow (SO) for the daily w...

Please sign up or login with your details

Forgot password? Click here to reset