REFORMS: Reporting Standards for Machine Learning Based Science

08/15/2023
by   Sayash Kapoor, et al.
0

Machine learning (ML) methods are proliferating in scientific research. However, the adoption of these methods has been accompanied by failures of validity, reproducibility, and generalizability. These failures can hinder scientific progress, lead to false consensus around invalid claims, and undermine the credibility of ML-based science. ML methods are often applied and fail in similar ways across disciplines. Motivated by this observation, our goal is to provide clear reporting standards for ML-based science. Drawing from an extensive review of past literature, we present the REFORMS checklist (Reporting Standards For Machine Learning Based Science). It consists of 32 questions and a paired set of guidelines. REFORMS was developed based on a consensus of 19 researchers across computer science, data science, mathematics, social sciences, and biomedical sciences. REFORMS can serve as a resource for researchers when designing and implementing a study, for referees when reviewing papers, and for journals when enforcing standards for transparency and reproducibility.

READ FULL TEXT

page 2

page 4

page 5

page 6

research
07/14/2022

Leakage and the Reproducibility Crisis in ML-based Science

The use of machine learning (ML) methods for prediction and forecasting ...
research
06/20/2021

Machine learning in the social and health sciences

The uptake of machine learning (ML) approaches in the social and health ...
research
07/29/2020

Integrating Machine Learning for Planetary Science: Perspectives for the Next Decade

Machine learning (ML) methods can expand our ability to construct, and d...
research
06/13/2022

Modeling the Machine Learning Multiverse

Amid mounting concern about the reliability and credibility of machine l...
research
09/08/2023

Commentary on Guyll et al. (2023): Misuse of Statistical Method Results in Highly Biased Interpretation of Forensic Evidence

Since the National Academy of Sciences released their report outlining p...
research
08/18/2020

Creating optimal conditions for reproducible data analysis in R with 'fertile'

The advancement of scientific knowledge increasingly depends on ensuring...
research
06/09/2018

A hybrid econometric-machine learning approach for relative importance analysis: Food inflation

A measure of relative importance of variables is often desired by resear...

Please sign up or login with your details

Forgot password? Click here to reset