A multi-level analysis of data quality for formal software citation

06/30/2023
by   David Schindler, et al.
0

Software is a central part of modern science, and knowledge of its use is crucial for the scientific community with respect to reproducibility and attribution of its developers. Several studies have investigated in-text mentions of software and its quality, while the quality of formal software citations has only been analyzed superficially. This study performs an in-depth evaluation of formal software citation based on a set of manually annotated software references. It examines which resources are cited for software usage, to what extend they allow proper identification of software and its specific version, how this information is made available by scientific publishers, and how well it is represented in large-scale bibliographic databases. The results show that software articles are the most cited resource for software, while direct software citations are better suited for identification of software versions. Moreover, we found current practices by both, publishers and bibliographic databases, to be unsuited to represent these direct software citations, hindering large-scale analyses such as assessing software impact. We argue that current practices for representing software citations – the recommended way to cite software by current citation standards – stand in the way of their adaption by the scientific community, and urge providers of bibliographic data to explicitly model scientific software.

READ FULL TEXT

page 1

page 20

page 21

page 22

page 23

page 24

research
11/01/2019

Practice meets Principle: Tracking Software and Data Citations to Zenodo DOIs

Data and software citations are crucial for the transparency of research...
research
11/27/2018

Challenges of measuring the impact of software: an examination of the lme4 R package

The rise of software as a research object is mirrored in the increasing ...
research
07/17/2023

How do software citation formats evolve over time? A longitudinal analysis of R programming language packages

Under the data-driven research paradigm, research software has come to p...
research
08/13/2021

On the evaluation of research software: the CDUR procedure

Background: Evaluation of the quality of research software is a challeng...
research
06/05/2023

Evaluation of software impact designed for biomedical research: Are we measuring what's meaningful?

Software is vital for the advancement of biology and medicine. Analysis ...
research
04/15/2021

A proposal for Transversal Computer-related Strategies Services for Scientific and Training efforts for the LASF4RI

This schematic proposal is looking to give a first view of the different...
research
06/12/2019

Better Code, Better Sharing:On the Need of Analyzing Jupyter Notebooks

By bringing together code, text, and examples, Jupyter notebooks have be...

Please sign up or login with your details

Forgot password? Click here to reset