DeepAI AI Chat
Log In Sign Up

Cracking Double-Blind Review: Authorship Attribution with Deep Learning

11/14/2022
by   Leonard Bauersfeld, et al.
0

Double-blind peer review is considered a pillar of academic research because it is perceived to ensure a fair, unbiased, and fact-centered scientific discussion. Yet, experienced researchers can often correctly guess from which research group an anonymous submission originates, biasing the peer-review process. In this work, we present a transformer-based, neural-network architecture that only uses the text content and the author names in the bibliography to atttribute an anonymous manuscript to an author. To train and evaluate our method, we created the largest authorship-identification dataset to date. It leverages all research papers publicly available on arXiv amounting to over 2 million manuscripts. In arXiv-subsets with up to 2,000 different authors, our method achieves an unprecedented authorship attribution accuracy, where up to 95 are not only able to predict the author of an anonymous work but we also identify weaknesses of the double-blind review process by finding the key aspects that make a paper attributable. We believe that this work gives precious insights into how a submission can remain anonymous in order to support an unbiased double-blind review process.

READ FULL TEXT

page 1

page 2

page 3

page 4

07/04/2018

Case for the double-blind peer review

Peer review is a process designed to produce a fair assessment of resear...
09/05/2017

Effectiveness of Anonymization in Double-Blind Review

Double-blind review relies on the authors' ability and willingness to ef...
02/06/2018

Uptake and outcome of manuscripts in Nature journals by review model and author characteristics

Double-blind peer review has been proposed as a possible solution to avo...
06/22/2017

A Community's Perspective on the Status and Future of Peer Review in Software Engineering

Context: Pre-publication peer review of scientific articles is considere...
06/01/2021

Some Ethical Issues in the Review Process of Machine Learning Conferences

Recent successes in the Machine Learning community have led to a steep i...
03/31/2022

To ArXiv or not to ArXiv: A Study Quantifying Pros and Cons of Posting Preprints Online

Double-blind conferences have engaged in debates over whether to allow a...
03/27/2019

Does My Rebuttal Matter? Insights from a Major NLP Conference

Peer review is a core element of the scientific process, particularly in...