Citation sentence reuse behavior of scientists: A case study on massive bibliographic text dataset of computer science

05/06/2017
by   Mayank Singh, et al.
0

Our current knowledge of scholarly plagiarism is largely based on the similarity between full text research articles. In this paper, we propose an innovative and novel conceptualization of scholarly plagiarism in the form of reuse of explicit citation sentences in scientific research articles. Note that while full-text plagiarism is an indicator of a gross-level behavior, copying of citation sentences is a more nuanced micro-scale phenomenon observed even for well-known researchers. The current work poses several interesting questions and attempts to answer them by empirically investigating a large bibliographic text dataset from computer science containing millions of lines of citation sentences. In particular, we report evidences of massive copying behavior. We also present several striking real examples throughout the paper to showcase widespread adoption of this undesirable practice. In contrast to the popular perception, we find that copying tendency increases as an author matures. The copying behavior is reported to exist in all fields of computer science; however, the theoretical fields indicate more copying than the applied fields.

READ FULL TEXT
research
09/19/2023

Modeling interdisciplinary interactions among Physics, Mathematics Computer Science

Interdisciplinarity has over the recent years have gained tremendous imp...
research
10/09/2017

Characterizing in-text citations in scientific articles: A large-scale analysis

We report characteristics of in-text citations in over five million full...
research
12/22/2019

Viewing Computer Science through Citation Analysis; Salton and Bergmark Redux

Computer science has experienced dramatic growth and diversification ove...
research
06/26/2018

An empirical investigation of the Tribes and their Territories: are research specialisms rural and urban?

We propose an operationalization of the rural and urban analogy introduc...
research
04/30/2019

On the Use of ArXiv as a Dataset

The arXiv has collected 1.5 million pre-print articles over 28 years, ho...
research
08/10/2021

Curatio et Innovatio

The Middle Ages focused obsessively on the old; our era is totally absor...
research
06/14/2022

An analysis of retracted papers in Computer Science

Context: The retraction of research papers, for whatever reason, is a gr...

Please sign up or login with your details

Forgot password? Click here to reset