If You've Seen One, You've Seen Them All: Leveraging AST Clustering Using MCL to Mimic Expertise to Detect Software Supply Chain Attacks

11/04/2020
by   Marc Ohm, et al.
0

Trojanized software packages used in software supply chain attacks constitute an merging threat. Unfortunately, there is still a lack of scalable approaches that allow automated and timely detection of malicious software packages. However, it has been observed that most attack campaigns comprise multiple packages that share the same or similar malicious code. We leverage that fact to automatically reproduce manually identified clusters of known malicious packages that have been used in real world attacks, thus, reducing the need for expert knowledge and manual inspection. Our approach, AST Clustering using MCL to mimic Expertise (ACME), yields promising results with a F_1 score of 0.99. Signatures are automatically generated based on representative code fragments from clusters and are subsequently used to scan the whole npm registry for unreported malicious packages. We are able to identify and report six malicious packages that have been removed from npm consequentially. Therefore, our approach is able to reproduce clustering based on expert knowledge and hence may be employed by maintainers of package repositories like npm to timely detect possible maliciousness of newly uploaded or updated packages.

READ FULL TEXT
research
05/19/2020

Backstabber's Knife Collection: A Review of Open Source Software Supply Chain Attacks

A software supply chain attack is characterized by the injection of mali...
research
10/08/2022

Towards the Detection of Malicious Java Packages

Open-source software supply chain attacks aim at infecting downstream us...
research
05/31/2023

You Can Run But You Can't Hide: Runtime Protection Against Malicious Package Updates For Node.js

Maliciously prepared software packages are an extensively leveraged weap...
research
09/16/2022

Malicious Source Code Detection Using Transformer

Open source code is considered a common practice in modern software deve...
research
02/17/2023

A Review of Attacks Against Language-Based Package Managers

The liberalization of software licensing has led to unprecedented re-use...
research
02/04/2020

Measuring and Preventing Supply Chain Attacks on Package Managers

Package managers have become a vital part of the modern software develop...
research
10/03/2021

Towards Informative Tagging of Code Fragments to Support the Investigation of Code Clones

Investigating the code fragments of code clones detected by code clone d...

Please sign up or login with your details

Forgot password? Click here to reset