The challenge of reproducible ML: an empirical study on the impact of bugs

09/09/2021
by   Emilio Rivera-Landos, et al.
0

Reproducibility is a crucial requirement in scientific research. When results of research studies and scientific papers have been found difficult or impossible to reproduce, we face a challenge which is called reproducibility crisis. Although the demand for reproducibility in Machine Learning (ML) is acknowledged in the literature, a main barrier is inherent non-determinism in ML training and inference. In this paper, we establish the fundamental factors that cause non-determinism in ML systems. A framework, ReproduceML, is then introduced for deterministic evaluation of ML experiments in a real, controlled environment. ReproduceML allows researchers to investigate software configuration effects on ML training and inference. Using ReproduceML, we run a case study: investigation of the impact of bugs inside ML libraries on performance of ML experiments. This study attempts to quantify the impact that the occurrence of bugs in a popular ML framework, PyTorch, has on the performance of trained models. To do so, a comprehensive methodology is proposed to collect buggy versions of ML libraries and run deterministic ML experiments using ReproduceML. Our initial finding is that there is no evidence based on our limited dataset to show that bugs which occurred in PyTorch do affect the performance of trained models. The proposed methodology as well as ReproduceML can be employed for further research on non-determinism and bugs.

READ FULL TEXT

page 1

page 2

page 8

research
06/24/2022

Bugs in Machine Learning-based Systems: A Faultload Benchmark

The rapid escalation of applying Machine Learning (ML) in various domain...
research
07/19/2023

Reproducibility in Machine Learning-Driven Research

Research is facing a reproducibility crisis, in which the results and fi...
research
08/11/2021

Why are Some Bugs Non-Reproducible? An Empirical Investigation using Data Fusion

Software developers attempt to reproduce software bugs to understand the...
research
07/14/2022

Leakage and the Reproducibility Crisis in ML-based Science

The use of machine learning (ML) methods for prediction and forecasting ...
research
06/10/2023

An Empirical Study of Bugs in Quantum Machine Learning Frameworks

Quantum computing has emerged as a promising domain for the machine lear...
research
12/15/2020

Run, Forest, Run? On Randomization and Reproducibility in Predictive Software Engineering

Machine learning (ML) has been widely used in the literature to automate...
research
05/27/2020

(Non)-neutrality of science and algorithms: Machine Learning between fundamental physics and society

The impact of Machine Learning (ML) algorithms in the age of big data an...

Please sign up or login with your details

Forgot password? Click here to reset