Finding Phish in a Haystack: A Pipeline for Phishing Classification on Certificate Transparency Logs

06/23/2021
by   Arthur Drichel, et al.
0

Current popular phishing prevention techniques mainly utilize reactive blocklists, which leave a “window of opportunity” for attackers during which victims are unprotected. One possible approach to shorten this window aims to detect phishing attacks earlier, during website preparation, by monitoring Certificate Transparency (CT) logs. Previous attempts to work with CT log data for phishing classification exist, however they lack evaluations on actual CT log data. In this paper, we present a pipeline that facilitates such evaluations by addressing a number of problems when working with CT log data. The pipeline includes dataset creation, training, and past or live classification of CT logs. Its modular structure makes it possible to easily exchange classifiers or verification sources to support ground truth labeling efforts and classifier comparisons. We test the pipeline on a number of new and existing classifiers, and find a general potential to improve classifiers for this scenario in the future. We publish the source code of the pipeline and the used datasets along with this paper (https://gitlab.com/rwth-itsec/ctl-pipeline), thus making future research in this direction more accessible.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/21/2018

The Rise of Certificate Transparency and Its Implications on the Internet Ecosystem

In this paper, we analyze the evolution of Certificate Transparency (CT)...
research
03/03/2022

Postcertificates for Revocation Transparency

The modern Internet is highly dependent on trust communicated via certif...
research
11/10/2017

Verifiable Light-Weight Monitoring for Certificate Transparency Logs

Trust in publicly verifiable Certificate Transparency (CT) logs is reduc...
research
01/13/2020

Characterizing the Root Landscape of Certificate Transparency Logs

Internet security and privacy stand on the trustworthiness of public cer...
research
06/22/2018

Aggregation-Based Gossip for Certificate Transparency

Certificate Transparency (CT) is a project that mandates public logging ...
research
03/03/2022

SoK: SCT Auditing in Certificate Transparency

The Web public key infrastructure is essential to providing secure commu...
research
10/05/2020

A Study on Trees's Knots Prediction from their Bark Outer-Shape

In the industry, the value of wood-logs strongly depends on their intern...

Please sign up or login with your details

Forgot password? Click here to reset