PreprintResolver: Improving Citation Quality by Resolving Published Versions of ArXiv Preprints using Literature Databases

09/04/2023
by   Louise Bloch, et al.
0

The growing impact of preprint servers enables the rapid sharing of time-sensitive research. Likewise, it is becoming increasingly difficult to distinguish high-quality, peer-reviewed research from preprints. Although preprints are often later published in peer-reviewed journals, this information is often missing from preprint servers. To overcome this problem, the PreprintResolver was developed, which uses four literature databases (DBLP, SemanticScholar, OpenAlex, and CrossRef / CrossCite) to identify preprint-publication pairs for the arXiv preprint server. The target audience focuses on, but is not limited to inexperienced researchers and students, especially from the field of computer science. The tool is based on a fuzzy matching of author surnames, titles, and DOIs. Experiments were performed on a sample of 1,000 arXiv-preprints from the research field of computer science and without any publication information. With 77.94 affected by missing publication information in arXiv. The results show that the PreprintResolver was able to resolve 603 out of 1,000 (60.3 from the research field of computer science and without any publication information. All four literature databases contributed to the final result. In a manual validation, a random sample of 100 resolved preprints was checked. For all preprints, at least one result is plausible. For nine preprints, more than one result was identified, three of which are partially invalid. In conclusion the PreprintResolver is suitable for individual, manually reviewed requests, but less suitable for bulk requests. The PreprintResolver tool (https://preprintresolver.eu, Available from 2023-08-01) and source code (https://gitlab.com/ippolis_wp3/preprint-resolver, Accessed: 2023-07-19) is available online.

READ FULL TEXT
research
08/03/2023

How many preprints have actually been printed and why: a case study of computer science preprints on arXiv

Preprints play an increasingly critical role in academic communities. Th...
research
09/05/2019

Author Growth Outstrips Publication Growth in Computer Science and Publication Quality Correlates with Collaboration

Although the computer science community successfully harnessed exponenti...
research
09/14/2020

A matter of time: publication dates in Web of Science Core Collection

Web of Science Core Collection, one of the most authoritative bibliograp...
research
07/26/2018

Missing author address information in Web of Science-An explorative study

Bibliometric analysis is increasingly used to evaluate and compare resea...
research
01/16/2023

Teaching Computer Science Students to Communicate Scientific Findings More Effectively

Science communication forms the bridge between computer science research...
research
09/15/2020

MatScIE: An automated tool for the generation of databases of methods and parameters used in the computational materials science literature

The number of published articles in the field of materials science is gr...
research
02/17/2021

So you want to be a Super Researcher?

Publishing original scientific research is inherent to the work of a res...

Please sign up or login with your details

Forgot password? Click here to reset