Best Practices in Scientific Computing

01/28/2021
by   Ricardo Sanchez, et al.
0

The world is becoming increasingly complex, both in terms of the rich sources of data we have access to as well as in terms of the statistical and computational methods we can use on those data. These factors create an ever-increasing risk for errors in our code and sensitivity in our findings to data preparation and execution of complex statistical and computing methods. The consequences of coding and data mistakes can be substantial. Openness (e.g., providing others with data code) and transparency (e.g., requiring that data processing and code follow standards) are two key solutions to help alleviate concerns about replicability and errors. In this paper, we describe the key steps for implementing a code quality assurance (QA) process for researchers to follow to improve their coding practices throughout a project to assure the quality of the final data, code, analyses and ultimately the results. These steps include: (i) adherence to principles for code writing and style that follow best practices, (ii) clear written documentation that describes code, workflow and key analytic decisions; (iii) careful version control, (iv) good data management; and (iv) regular testing and review. Following all these steps will greatly improve the ability of a study to assure results are accurate and reproducible. The responsibility for code QA falls not only on individual researchers but institutions, journals, and funding agencies as well.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2022

The importance of good coding practices for data scientists

Many data science students and practitioners are reluctant to adopt good...
research
02/15/2022

Eliciting Best Practices for Collaboration with Computational Notebooks

Despite the widespread adoption of computational notebooks, little is kn...
research
06/12/2020

Workflow environments for advanced cyberinfrastructure platforms

Progress in science is deeply bound to the effective use of high-perform...
research
08/24/2021

Do Comments follow Commenting Conventions? A Case Study in Java and Python

Assessing code comment quality is known to be a difficult problem. A num...
research
06/12/2019

Better Code, Better Sharing:On the Need of Analyzing Jupyter Notebooks

By bringing together code, text, and examples, Jupyter notebooks have be...
research
03/23/2021

A large-scale study on research code quality and execution

This article presents a study on the quality and execution of research c...
research
02/03/2023

Four principles for improved statistical ecology

Increasing attention has been drawn to the misuse of statistical methods...

Please sign up or login with your details

Forgot password? Click here to reset