HTMLPhish: Enabling Accurate Phishing Web Page Detection by Applying Deep Learning Techniques on HTML Analysis

08/28/2019
by   Chidimma Opara, et al.
0

Recently, the development and implementation of phishing attacks require little technical skills and costs. This uprising has led to an ever-growing number of phishing attacks on the World Wide Web daily. Consequently, proactive techniques to fight phishing attacks have become extremely necessary. In this paper, we propose a deep learning model HTMLPhish based on the HTML analysis of a web page for accurate phishing attack detection. By using our proposed HTMLPhish, the experimental results on a dataset of over 300,000 web pages yielded 97.2 learning methods such as Support Vector Machine, Random Forest and Logistics Regression. We also show the advantage of HTMLPhish in the aspect of the temporal stability and robustness by testing our proposed model on a dataset collected after two months when the model was trained. In addition, HTMLPhish is a completely language-independent and client-side strategy which can, therefore, conduct web page phishing detection regardless of the textual language.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2020

Look Before You Leap: Detecting Phishing Web Pages by Exploiting Raw URL And HTML Characteristics

Cybercriminals resort to phishing as a simple and cost-effective medium ...
research
03/06/2022

Adaptive technique for web page change detection using multi-threaded crawlers

World Wide Web is getting dense as many new web pages and resources are ...
research
11/06/2020

Web Application Attack Detection using Deep Learning

Modern web applications are dominated by HTTP/HTTPS messages that consis...
research
04/27/2018

An Element Sensitive Saliency Model with Position Prior Learning for Web Pages

Understanding human visual attention is important for multimedia applica...
research
10/26/2021

Fragment-Based Test Generation For Web Apps

Automated model-based test generation presents a viable alternative to t...
research
11/21/2018

Malicious Web Request Detection Using Character-level CNN

Web parameter injection attacks are common and powerful. In this kind of...
research
10/26/2022

WebCrack: Dynamic Dictionary Adjustment for Web Weak Password Detection based on Blasting Response Event Discrimination

The feature diversity of different web systems in page elements, submiss...

Please sign up or login with your details

Forgot password? Click here to reset