What Challenges Do Developers Face About Checked-in Secrets in Software Artifacts?

01/29/2023
by   Setu Kumar Basak, et al.
0

Throughout 2021, GitGuardian's monitoring of public GitHub repositories revealed a two-fold increase in the number of secrets (database credentials, API keys, and other credentials) exposed compared to 2020, accumulating more than six million secrets. To our knowledge, the challenges developers face to avoid checked-in secrets are not yet characterized. The goal of our paper is to aid researchers and tool developers in understanding and prioritizing opportunities for future research and tool automation for mitigating checked-in secrets through an empirical investigation of challenges and solutions related to checked-in secrets. We extract 779 questions related to checked-in secrets on Stack Exchange and apply qualitative analysis to determine the challenges and the solutions posed by others for each of the challenges. We identify 27 challenges and 13 solutions. The four most common challenges, in ranked order, are: (i) store/version of secrets during deployment; (ii) store/version of secrets in source code; (iii) ignore/hide of secrets in source code; and (iv) sanitize VCS history. The three most common solutions, in ranked order, are: (i) move secrets out of source code/version control and use template config file; (ii) secret management in deployment; and (iii) use local environment variables. Our findings indicate that the same solution has been mentioned to mitigate multiple challenges. However, our findings also identify an increasing trend in questions lacking accepted solutions substantiating the need for future research and tool automation on managing secrets.

READ FULL TEXT

page 1

page 8

research
08/24/2022

What are the Practices for Secret Management in Software Artifacts?

Throughout 2021, GitGuardian's monitoring of public GitHub repositories ...
research
11/11/2022

Committed by Accident: Studying Prevention and Remediation Strategies Against Secret Leakage in Source Code Repositories

Version control systems for source code, such as Git, are key tools in m...
research
03/12/2023

SecretBench: A Dataset of Software Secrets

According to GitGuardian's monitoring of public GitHub repositories, the...
research
10/22/2021

Methodology for Assessing the State of the Practice for Domain X

To improve software development methods and tools for research software,...
research
09/10/2017

The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing

Graph processing is becoming increasingly prevalent across many applicat...
research
07/03/2023

A Comparative Study of Software Secrets Reporting by Secret Detection Tools

Background: According to GitGuardian's monitoring of public GitHub repos...
research
05/14/2021

Reasons, Challenges and Some Tools for Doing Reproducible Research in Transportation Research

This paper introduces reproducible research, and explains its importance...

Please sign up or login with your details

Forgot password? Click here to reset