18 Million Links in Commit Messages: Purpose, Evolution, and Decay

05/26/2023
by   Tao Xiao, et al.
0

Commit messages contain diverse and valuable types of knowledge in all aspects of software maintenance and evolution. Links are an example of such knowledge. Previous work on "9.6 million links in source code comments" showed that links are prone to decay, become outdated, and lack bidirectional traceability. We conducted a large-scale study of 18,201,165 links from commits in 23,110 GitHub repositories to investigate whether they suffer the same fate. Results show that referencing external resources is prevalent and that the most frequent domains other than github.com are the external domains of Stack Overflow and Google Code. Similarly, links serve as source code context to commit messages, with inaccessible links being frequent. Although repeatedly referencing links is rare (4 unavailable over time; e.g., tutorials or articles and software homepages become unavailable over time. Furthermore, we find that 70 links suffer from decay; the domains that occur the most frequently are related to Subversion repositories. We summarize that links in commits share the same fate as links in code, opening up avenues for future work.

READ FULL TEXT
research
01/22/2019

9.6 Million Links in Source Code Comments: Purpose, Evolution, and Decay

Links are an essential feature of the World Wide Web, and source code re...
research
04/08/2021

An Exploratory Study on the Repeatedly Shared External Links on Stack Overflow

On Stack Overflow, users reuse 11,926,354 external links to share the re...
research
04/21/2022

Message Flow Analysis with Complex Causal Links for Distributed ROS 2 Systems

Distributed robotic systems rely heavily on publish-subscribe frameworks...
research
10/10/2020

Broken External Links on Stack Overflow

Stack Overflow hosts valuable programming-related knowledge with 11,926,...
research
06/11/2019

Contextual Documentation Referencing on Stack Overflow

Software engineering is knowledge-intensive and requires software develo...
research
04/13/2021

Science-Software Linkage: The Challenges of Traceability between Scientific Knowledge and Software Artifacts

Although computer science papers are often accompanied by software artif...

Please sign up or login with your details

Forgot password? Click here to reset