The importance of good coding practices for data scientists

10/08/2022
by   Randall Pruim, et al.
0

Many data science students and practitioners are reluctant to adopt good coding practices as long as the code "works". However, code standards are an important part of modern data science practice, and they play an essential role in the development of "data acumen". Good coding practices lead to more reliable code and often save more time than they cost, making them important even for beginners. We believe that principled coding practices are vital for statistics and data science. To install these practices within academic programs, it is important for instructors and programs to begin establishing these practices early, to reinforce them often, and to hold themselves to a higher standard while guiding students. We describe key aspects of coding practices (both good and bad), focusing primarily on the R language, though similar standards are applicable to other software environments. The lessons are organized into a top ten list.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2020

A large-scale comparative analysis of Coding Standard conformance in Open-Source Data Science projects

Background: Meeting the growing industry demand for Data Science require...
research
01/28/2021

Best Practices in Scientific Computing

The world is becoming increasingly complex, both in terms of the rich so...
research
09/17/2021

Opinionated practices for teaching reproducibility: motivation, guided instruction and practice

In the data science courses at the University of British Columbia, we de...
research
01/24/2023

MLinter: Learning Coding Practices from Examples-Dream or Reality?

Coding practices are increasingly used by software companies. Their use ...
research
06/23/2021

A Silicon Valley Love Triangle: Hiring Algorithms, Pseudo-Science, and the Quest for Auditability

In this paper, we suggest a systematic approach for developing socio-tec...
research
02/11/2023

On the Use of Static Analysis to Engage Students with Software Quality Improvement: An Experience with PMD

Static analysis tools are frequently used to scan the source code and de...
research
12/17/2021

Call for establishing benchmark science and engineering

This article investigates the origin and evolution of the benchmark term...

Please sign up or login with your details

Forgot password? Click here to reset