How many preprints have actually been printed and why: a case study of computer science preprints on arXiv

08/03/2023
by   Jialiang Lin, et al.
0

Preprints play an increasingly critical role in academic communities. There are many reasons driving researchers to post their manuscripts to preprint servers before formal submission to journals or conferences, but the use of preprints has also sparked considerable controversy, especially surrounding the claim of priority. In this paper, a case study of computer science preprints submitted to arXiv from 2008 to 2017 is conducted to quantify how many preprints have eventually been printed in peer-reviewed venues. Among those published manuscripts, some are published under different titles and without an update to their preprints on arXiv. In the case of these manuscripts, the traditional fuzzy matching method is incapable of mapping the preprint to the final published version. In view of this issue, we introduce a semantics-based mapping method with the employment of Bidirectional Encoder Representations from Transformers (BERT). With this new mapping method and a plurality of data sources, we find that 66 unchanged titles and 11 modifications. A further analysis was then performed to investigate why these preprints but not others were accepted for publication. Our comparison reveals that in the field of computer science, published preprints feature adequate revisions, multiple authorship, detailed abstract and introduction, extensive and authoritative references and available source code.

READ FULL TEXT
research
09/04/2023

PreprintResolver: Improving Citation Quality by Resolving Published Versions of ArXiv Preprints using Literature Databases

The growing impact of preprint servers enables the rapid sharing of time...
research
09/16/2019

Proceedings of the 27th International Symposium on Graph Drawing and Network Visualization (GD 2019)

This is the arXiv index for the electronic proceedings of GD 2019, which...
research
05/04/2023

A Monoidal View on Fixpoint Checks

Fixpoints are ubiquitous in computer science as they play a central role...
research
09/10/2021

Proceedings of the 29th International Symposium on Graph Drawing and Network Visualization (GD 2021)

This is the arXiv index for the electronic proceedings of GD 2021, which...
research
09/11/2023

Proceedings of the 31st International Symposium on Graph Drawing and Network Visualization (GD 2023)

This is the arXiv index for the electronic proceedings of GD 2023, which...
research
10/14/2017

Popularity of arXiv.org within Computer Science

It may seem surprising that, out of all areas of science, computer scien...
research
07/14/2017

Sustainable computational science: the ReScience initiative

Computer science offers a large set of tools for prototyping, writing, r...

Please sign up or login with your details

Forgot password? Click here to reset