A Comparative Study of Software Secrets Reporting by Secret Detection Tools

07/03/2023
by   Setu Kumar Basak, et al.
0

Background: According to GitGuardian's monitoring of public GitHub repositories, secrets sprawl continued accelerating in 2022 by 67 2021, exposing over 10 million secrets (API keys and other credentials). Though many open-source and proprietary secret detection tools are available, these tools output many false positives, making it difficult for developers to take action and teams to choose one tool out of many. To our knowledge, the secret detection tools are not yet compared and evaluated. Aims: The goal of our study is to aid developers in choosing a secret detection tool to reduce the exposure of secrets through an empirical investigation of existing secret detection tools. Method: We present an evaluation of five open-source and four proprietary tools against a benchmark dataset. Results: The top three tools based on precision are: GitHub Secret Scanner (75 Commercial X (25 and TruffleHog (52 false positives are due to employing generic regular expressions and ineffective entropy calculation. In contrast, false negatives are due to faulty regular expressions, skipping specific file types, and insufficient rulesets. Conclusions: We recommend developers choose tools based on secret types present in their projects to prevent missing secrets. In addition, we recommend tool vendors update detection rules periodically and correctly employ secret verification mechanisms by collaborating with API vendors to improve accuracy.

READ FULL TEXT

page 6

page 7

research
03/12/2023

SecretBench: A Dataset of Software Secrets

According to GitGuardian's monitoring of public GitHub repositories, the...
research
08/13/2020

Sniffing for Codebase Secret Leaks with Known Production Secrets in Industry

Leaked secrets, such as passwords and API keys, in codebases were respon...
research
11/11/2022

Committed by Accident: Studying Prevention and Remediation Strategies Against Secret Leakage in Source Code Repositories

Version control systems for source code, such as Git, are key tools in m...
research
08/24/2022

What are the Practices for Secret Management in Software Artifacts?

Throughout 2021, GitGuardian's monitoring of public GitHub repositories ...
research
12/24/2022

Bernoulli honeywords

Decoy passwords, or “honeywords,” planted in a credential database can a...
research
01/29/2023

What Challenges Do Developers Face About Checked-in Secrets in Software Artifacts?

Throughout 2021, GitGuardian's monitoring of public GitHub repositories ...
research
09/12/2023

Commands as AI Conversations

Developers and data scientists often struggle to write command-line inpu...

Please sign up or login with your details

Forgot password? Click here to reset