Revisiting Dockerfiles in Open Source Software Over Time

03/23/2021
by   Kalvin Eng, et al.
0

Docker is becoming ubiquitous with containerization for developing and deploying applications. Previous studies have analyzed Dockerfiles that are used to create container images in order to better understand how to improve Docker tooling. These studies obtain Dockerfiles using either Docker Hub or Github. In this paper, we revisit the findings of previous studies using the largest set of Dockerfiles known to date with over 9.4 million unique Dockerfiles found in the World of Code infrastructure spanning from 2013-2020. We contribute a historical view of the Dockerfile format by analyzing the Docker engine changelogs and use the history to enhance our analysis of Dockerfiles. We also reconfirm previous findings of a downward trend in using OS images and an upward trend of using language images. As well, we reconfirm that Dockerfile smell counts are slightly decreasing meaning that Dockerfile authors are likely getting better at following best practices. Based on these findings, it indicates that previous analyses from prior works have been correct in many of their findings and their suggestions to build better tools for Docker image creation are further substantiated.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2021

Python Crypto Misuses in the Wild

Background: Previous studies have shown that up to 99.59 using crypto A...
research
04/28/2023

A Network Perspective on the Influence of Code Review Bots on the Structure of Developer Collaborations

Background: Despite a growing body of literature on the impact of softwa...
research
12/21/2021

Well Begun is Half Done: An Empirical Study of Exploitability Impact of Base-Image Vulnerabilities

Container technology, (e.g., Docker) is being widely adopted for deployi...
research
03/14/2023

Casual Source Code Editing

There has been substantial research undertaken on the role of computatio...
research
08/03/2023

Replicability Study: Corpora For Understanding Simulink Models Projects

Background: Empirical studies on widely used model-based development too...
research
02/15/2022

Worldwide Gender Differences in Public Code Contributions

Gender imbalance is a well-known phenomenon observed throughout sciences...

Please sign up or login with your details

Forgot password? Click here to reset