Reproducibility in NLP: What Have We Learned from the Checklist?

06/16/2023
by   Ian Magnusson, et al.
0

Scientific progress in NLP rests on the reproducibility of researchers' claims. The *CL conferences created the NLP Reproducibility Checklist in 2020 to be completed by authors at submission to remind them of key information to include. We provide the first analysis of the Checklist by examining 10,405 anonymous responses to it. First, we find evidence of an increase in reporting of information on efficiency, validation performance, summary statistics, and hyperparameters after the Checklist's introduction. Further, we show acceptance rate grows for submissions with more Yes responses. We find that the 44 submissions that gather new data are 5 that did not; the average reviewer-rated reproducibility of these submissions is also 2 claim to open-source their code, though submissions that do have 8 reproducibility score relative to those that do not, the most for any item. We discuss what can be inferred about the state of reproducibility in NLP, and provide a set of recommendations for future conferences, including: a) allowing submitting code and appendices one week after the deadline, and b) measuring dataset reproducibility by a checklist of data collection practices.

READ FULL TEXT

page 5

page 6

page 16

page 17

page 18

page 19

page 20

page 21

research
06/07/2023

Investigating Reproducibility at Interspeech Conferences: A Longitudinal and Comparative Perspective

Reproducibility is a key aspect for scientific advancement across discip...
research
04/06/2021

Efficient transfer learning for NLP with ELECTRA

Clark et al. [2020] claims that the ELECTRA approach is highly efficient...
research
04/12/2022

Quantified Reproducibility Assessment of NLP Results

This paper describes and tests a method for carrying out quantified repr...
research
03/28/2023

Reproducibility is Nothing without Correctness: The Importance of Testing Code in NLP

Despite its pivotal role in research experiments, code correctness is of...
research
09/02/2021

Quantifying Reproducibility in NLP and ML

Reproducibility has become an intensely debated topic in NLP and ML over...
research
04/09/2022

A Siren Song of Open Source Reproducibility

As reproducibility becomes a greater concern, conferences have largely c...
research
07/16/2019

Evaluating the Reproducibility of Research in Obstetrics and Gynecology

Objective: Reproducibility is a core tenet of scientific research. A rep...

Please sign up or login with your details

Forgot password? Click here to reset