Phishing URL Detection Through Top-level Domain Analysis: A Descriptive Approach

05/13/2020
by   Orestis Christou, et al.
0

Phishing is considered to be one of the most prevalent cyber-attacks because of its immense flexibility and alarmingly high success rate. Even with adequate training and high situational awareness, it can still be hard for users to continually be aware of the URL of the website they are visiting. Traditional detection methods rely on blocklists and content analysis, both of which require time-consuming human verification. Thus, there have been attempts focusing on the predictive filtering of such URLs. This study aims to develop a machine-learning model to detect fraudulent URLs which can be used within the Splunk platform. Inspired from similar approaches in the literature, we trained the SVM and Random Forests algorithms using malicious and benign datasets found in the literature and one dataset that we created. We evaluated the algorithms' performance with precision and recall, reaching up to 85 recall in the case of Random Forests while SVM achieved up to 90 88

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/05/2020

Bayesian Optimization with Machine Learning Algorithms Towards Anomaly Detection

Network attacks have been very prevalent as their rate is growing tremen...
research
11/21/2018

Privacy-Preserving Collaborative Prediction using Random Forests

We study the problem of privacy-preserving machine learning (PPML) for e...
research
06/04/2022

Leveraging Machine Learning for Ransomware Detection

The current pandemic situation has increased cyber-attacks drastically w...
research
04/28/2015

Explaining the Success of AdaBoost and Random Forests as Interpolating Classifiers

There is a large literature explaining why AdaBoost is a successful clas...
research
08/09/2018

Code-Mixed Sentiment Analysis Using Machine Learning and Neural Network Approaches

Sentiment Analysis for Indian Languages (SAIL)-Code Mixed tools contest ...
research
03/05/2022

Fuzzy Forests For Feature Selection in High-Dimensional Survey Data: An Application to the 2020 U.S. Presidential Election

An increasingly common methodological issue in the field of social scien...

Please sign up or login with your details

Forgot password? Click here to reset